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ABSTRACT 


The first part of this thesis considers the complexity of Boolean 
functions. The main complexity measures used are the number of gates in 
combinational networks and the size of Boolean formulas. The case of 
monotone realizations, using only the operations of AND and OR, of monotone 
functions is emphasized. | 

For a particular class of monotone functions, the quadratic onccione 
the worst-case values for the monotone circuit complexity is shown to be 
proportional to n“/log n. The number of a-gates necessary to compute any 
quadrastic function is also analyzed. 

A technique for deriving bounds on monotone circuit size of threshold 
functions is applied to the "majority" function (threshold n/2) to establish 
a lower bound on its monotone circuit complexity of 3n-0(1). For the function 
"threshold 2", previously known lower bounds on the number of v-gates required 


are extended in the case of a circuit which has a minimal number of a-gates. 
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As a result, it follows that no monotone circuit for “threshold 2" can 
simultaneously have both a minimal number of a-gates and a minimal number 
of v-gates. 

The complexity of combinations of functions eh dissoint: sete of variables 
is studied, and a gap between the formula and circuit size of a particular 
function is given. 

Finally, we study the effect of allowing negation in a formula for mono- 
tonic functions. Examples are given both of functions in which using negations 
allows more succinct expressions, and functions in which it does not. 

The second part of the thesis describes an algorithm for computing shortest 
paths in a graph. These results show that an algorithm originally proposed 
by Spira for this problem can have slow running time. The lacuna in his 
algorithm is repaired, and it is shown to have 0(n@(iog n)*) average running 
time over wide classes of graphs as Spira originally claimed. As a special 
case, a transitive closure algorithm with 0(n? log n) average time is also 


described. 


Thesis Supervisor: | Albert R. Meyer, Professor of Computer Science. 


Key Words: Boolean functions, circuit complexity, Boolean formula size, 
monotone networks, threshold functions, quadratic functions, 
algorithms, shortest paths, directed graphs, transitive closure, 
computational complexity. 
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CHAPTER 1 


Introduction 


This thesis consists of two parts, each devoted to a different 
area in the theory of computation. It is primarily devoted to a 
study of the complexity of Boolean functions, with an emphasis on 


monotone Boolean functions. A secondary objective is the analysis 


. of algorithms which compute the shortest distances between all points 


in a graph and which compute the transitive closure of a Boolean 
matrix. 

In general, computational complexity theory asks questions 
about the computational resources required to solve a problem or 
compute a function. Traditional complexity theory has been pri- 
marily concerned with the difficulty of computing recursive functions 
on various models of machines such as Turing Machines or Random 
Access Machines; measures such as time ‘or space on these machines 
are considered. Finite functions (i.e. those with finite domain), 
since they are computable by finite state machines, are inherently 
"easy" according to these definitions since they require only enough time 
and space to read the input and print the output.: Several models 
of computing machines for finite functions have been proposed over 
the past years, and these have been studied as a means for defining 
the complexity of finite functions. Some of these, such as 


circuit complexity, receive their motivation from computer technol- 
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ogy, while others such as formula size-are more mathematical in 
origin. In the work which has been. done so far, these models are 
seen to have a rich mathematical structure which is only now begin- 
ning to be understood to any extent. The bulk of the research con- 
tained in this thesis is a study of several problems in this area 
of finite computation. 

We restrict ourselves to the Boolean case in which the finite 
domain and range are vector spaces over’ the set {0,1}. The original 
work in this area was directed towards finding asymptotic results 
about the worst-case complexity of al] Boolean functions on n 
variables. Work of Shannon [1949], Lupanov [1958,1962], and others: 
[Krichevskii 1961] established that “most” Boolean functions on 
n variables have minimal formulas of size asymptotic with 2" 710g n,t 
and minimal circuits of size asymptotic with 2"/n. During the 
1950's attention was focused more on individual "practical" problems, 
and on finding optimal or near-optimal canonical formulas and 
circuits for a function with a given input-output specification. 
Work of Quine [1952,1955] and others [McCluskey 1956, Karnaugh 1953] 
explored methods for finding efficient ways.of computing or expres- 
sing a specific function. 

Recent research has been directed primarily at finding exact 
values for the complexity of individual functions in different 


models of complexity. As an adjunct to this pursuit, other more 


t In this report, all logarithms are to the base 2. 
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- basic questions such as the relationship between complexity in 


different models have.been studied. This search for optimal 
circuits and formulas is only indirectly motivated from an engi- 
neering point of view’, but is relevant to complexity theory. 
Several authors [Pippenger and Fischer. 1977, Schnorr 1976d] 
have demonstrated a connection between the Turing Machine complexity 
of a function and the circuit complexity of finite restrictions of 
that function. These results essentially show that a function is 
easy to compute with respect to oracle Turing Machine time if and 
only if it has a small circuit on each set of finite length inputs. 
This work relates to recent studies of non-deterministic polyno- 
mial time (NP) and to the NP-complete problems of Cook [1971] and 
Karp [1972] because it implies that if P= NP, then the finite 
restriction of any problem in wP has a polynomial-sized circuit. 
While this latter question is obviously still open, this observation — 
has given impetus to many researchers. to explore the possibility of 
proving large lower bounds on the complexity of specific Boolean 


functions. 


t 
Considerations such as the number of gates in a circuit or the 


delay time are not as important as other factors such as the inter- 


connections between gates and the fan-out of the gates. 
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The results obtained thus far have not been very encouraging 


in pursuit. of this goal. The lower bounds obtained may be divided 
into two types. Diagonalstype arguments can obtain exponential 
lower bounds on the complexity of specific functions. For example, 
Stockmeyer [1974] and Meyer have shown that, sfnce there are functions 
which are recognizable on a deterministic Turing Machine in space Qn 
whose restrictions to inputs of fixed length have exponential circuit 
complexity, one can get a lower bound of :¢” on any circuit which 
computes a restriction of certain logical languages. Thus far, 
results of. this sort have been limited to Boolean functions which 
are binary codings of problems which are provably difficult to compute 
on a Turing Machine; the functions involved have not arisen: as 
Boolean functions in their own right. Moreover, since the NP-complete 
problems are not yet provably difficult for a Turing Machine, these 
techniques do not apply to them. 
Direct arguments for particular functions have not yielded much 

in the way of lower bounds. For single-output functions, no lower © 
bounds on circuit complexity larger than linear in the number of 
inputs have yet been pioiats | 

’ The order of these bounds has not been improved even when the 
allowable gates in the circuit are highly restricted. For the 
complete basis of all binary Boolean operations, the largest of 


these lower bounds is 2.5n recently discovered by . Paul [1977] 


-13- 
and Stockmeyer [1977]. TFor restricted bases, the largest known lower — 
bound is 7n for particular functions [Redkin 1973]. Work contained in 
this thesis establishes lower bounds on the monotone circuit complexity 
of a particular threshold function which is equal to the largest lower 
bound achieved for this basis (namely 3n). 

For multi-output functions, larger lower bounds have been achieved 
on the circuit comp lexi ty in certain bases by exploiting the interplay 
between the functions computed by different outputs. For example, 
_ Paterson [1975] and Mehlhorn and Galil [1976], indepently extending earlier 
work of Pratt [1974], have shown that. the monotone circuit complexity of 
the multiplication of two nxn matrices is ane-n?. Several other 
researchers have used different techniques to obtain lower bounds propor- 
tional to n log n on functions related to sorting problems [Lamagna and 
Savage 1974, Lamagna 1975]. 


' We observe that in both cases the bounds obtained are far short of 


the exponential lower bounds theoretically possiblé for most functions. 


t Recently, Lipton and Tarjon [1977] exhibited larger bounds for 
planar circuits, and Schnorr [1978b] has. announced a 3n lower bound for 
arbitrary circuits. 

at Recently, these techniques have been extended to yield an a(n2/1og-n) 
lower bound on the monotone circuit complexity of certain functions 


[Wegener 1978] 
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For formulas the results are also limited. If arbitrary con- 
nectives are allowed in the formula, the Targest lower bounds are 
given by a general technique of Nefiporuk [1966] which he.uses to 
establish lower bounds proportional to. n2/log n for a specific 
function. Harper and Savage use NeCiporuk's method to establish 
lower bounds on the complexity of the marriage problem [1972]. | 

This technique is used later in this thesis to exhibit a monotone 
function for which negations don't allow a reduction in the formula 
size by more than a constant factor. Other lower bounds for | 
formulas in which arbitrary binary connectives are allowed have 
been reported by several authors [Hansel 1964, Hodes and Specker 
1968, Vilfan 1972, and Fischer, Meyer and Paterson 1975). For. more 
restricted sets of connectives, Khrapchenko [1971] established 
lower bounds on the {A,v,7} formula complexity for the parity 
function proportional to n’, and this is the largest polynomial 
bound yet established for formula complexity. 

The principal research contained in this thesis is the estab- 
lishment of lower bounds on the complexity of specific Boolean 
functions, but includes more general work on asymptotic. bounds on 
the complexity classes of Boolean functions and on fie pelation: 
ship between several complexity measures. In particular, we begin 
by studying the monotone circuit complexity of quadratic monotone 


Boolean functions. (Those for which each prime implicant is a 
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product of two distinct variables.) Using simple counting techniques 
we show that most quadratic functions require at least 2(n*/log n)* 
gates to compute in any circuit. MO the other hand, we ‘show that 
a monotone circuit of size proportional to n 2 Hog n exists for 
every quadratic function. When counting individual types of gates 
in a monotone circuit, we show that n-i a«a-gates suffice to compute 
any quadratic function, and exhibit a specific function for which 
2n/3 a-gates are necessary in any monotone circuit. | 
In chapter 4 we examine the question of whether it may be 
easier to compute combinations of several functions than to compute 
the individual functions and then combine them. Both monotone 
and general circuits and formulas are considered. As a corollary 
to the work reported there, we establish the existence of a function 
of polynomial complexity which has’ smaller circuit complexity than 
formula complexity -- in fact theie ratio is proportional to 
n/log log n. 
Chapter 5 contains an examination of the monotone circuit 
complexity of the threshold functions. Using techniques ginitlar 
to Paul [1977], Stockmeyer [1977], and Schnorr [1974], we establish 
larger lower bounds on their complexity than previously known. 
These bounds are in some cases quite small in comparison with the 
best known upper bounds. tin particular: for the function 


"thresholid n/2" of n_ variables, we establish a lower bound of 


: For notation, see page 51. 
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' 3n~7 gates necessary in any monotone circuit; ton the other hand, the 
smallest known circuit has a number of gates proportional to n(log n)?. 


For the function "threshold 2" of n variables, we present exact 
bourids on the number of a-gates and v-gates for any monotone 
circuit. These bounds were observed by F.F. Yao't _We extend 
these bounds in the case of a circuit with the minimal number of 
a-gates for values of n which are a power of 2, and show that for 
these values of n it is Ampos S10 1e to simu] taneously achieve the 
minimum number of both types of gates. The latter result was again 
previously observed by F.F. Yao. _ | 

Finally, we study the effect of different bases on the -formula 
complexity of monotone functions. In work done jointly with M. 
Paterson, we exhibit a function with monotone formula ounce? 
proportional to n 2 Hog n for which any formula in an arbitrary 
basis can be no more than a constant factor smaller. On the other 
hand, in work done jointly with A. Meyer, we show that there are 
monotone Boolean functions for which the smallest monotone formula 
is larger by a factar of @(n) than a formula for the function in 


the basis of all binary operators. 


z Recently these techniques have been extended by the author to yield a 3n-0(1) 
lower bound on the -{A, v, —} complexity of this function. 


tt Personal communication, 1975. 
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The final part of this thesis deals with the problems of . 
determining the transitive closure of a Boolean matrix and deter- 
mining the shortest distances between all points of a non-negatively 
weighted graph. P.M. Spira [1973] has published an algorithm to 
find the shortest distance matrix which he claims has an average 


running time (over a large class of weighted directed graphs) of 


O(n? log" n). Research by A. Meyer, M.J. Fischer, and this author 
has demonstrated classes for which his algorithm has slow average 
running time. A revision of his algorithm does indeed have 


2 n) average time, over even wider classes of graphs than 


0(n2 log 
Spira claimed. This result is presented in Chapter 7; it is also 
shown there that a further revision yields a simple O(n? log n) 
average time algorithm to compute the Boolean transitive closure 
over a wide class of probability distributions on matrices. | 


We begin by presenting some introductory remarks on Boolean 


functions. 
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CHAPTER 2 


Definitions and Preliminary Results 


Section A. Boolean Functions 


An n-input, m-output Boolean function is a function with 
domain 10,13" and range 0,13" . We will denote the set of all 
n-input, m-ouvtput Boolean functions by Bh om We will generally be 
interested in one-output Boolean functions and denote the set Bh, 1 of 
all such functions with n inputs as BAe If FeBy im? we usually 
denote an arbitrary element of the domain of f by k= (Xp sXooee oXq)s 
and refer to each x, for 1< isn asa variable of f. Some 


i 
specific Boolean functions we will refer to are the unary function 


negation or NOT (denoted — ), and the binary functions disjunction 
or OR (v), conjunction or AND (a or +) * EXCLUSIVE OR or mod 2 sum 
(@ ), equivalence (=); and the constant functions 0 and 1. 


The sets of functions BL have been extensively studied. from 


m 
a mathematical point of view [Harrison 1965, MacLane and Birkhoff 
1967]. We assume that the reader is familiar with elementary 
properties of these sets. We use notation which is standard and 


refer the reader to Harrison [1965] for elaboration and proofs of 


Frequently we will denote the conjunction of variables x and y 


by their concatenation xy. 
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results that we only state’. 

“Boolean functions may be combined in standard fashion. In 
particular the composition of functions is defined as usual. If 
f and g are members of B, for some n, we-define  fvgeB, by 
(fvg)(x) = £(X)vg(x) for any Ye{0,1}". If a collection of functions 
(f;| jel) is each a member of B_, the disjunction VifjeB, is 
defined. in the obyious fashion. The conjunction. of: functions is 
defined similarly. In addition, we.may define, from the functions 


feB and geB 


Ne Ms: the function fxgeB by 
ro 


Nom,” ny tn, im +m, 


Mae Dae See A = ie aaa GY] o--- Yq) 


om  - mM 
(where we use the obvious isomorphism between {0,1} 10,1} . and 


{0,1} ). The combination v(fxg) is defined, for FeB 
Se 1 


+ Examination of the properties of Boolean functions indicates 
strong similarities among them. In fact, they may be grouped into 
pairs in which occurrences of a and v are interchanged, as well as 
occurrences of 0 and 1. Such statements are called ‘duals of each 
other. It is a well known fact that a Boolean statement is true if 
and only if its dual is true. This is known as the pricipal of 
duality, and permits some economy in proving properties of Boolean 


functions. 


Sie 
and geB » to be the obvious composition of v and x ; that is 
2 | 
v(fxg) (x75. oe ny Sy gees Ing) = F(x, geese “ny valy, gees Vn): 
A(fxg) is defined similarly. 
Two properties of these combinations which we will use are 


Stated in the following lemma: 


2.1 Lemma: Suppose f and g are members of B, for some n2 0. 


Then (a) if fvg = 0, the constant function 0, then f = 0 and g =U 


and (b) if fag=1, then f=] and g=1 


One specific set of Boolean functions is the set of projection 
functions. The jth projection function of n_ variables, 
denoted Wy is defined by Ty (xq 9+ + »X_) =X, for each i in 
{A2iecesht, If f is an arbitrary member of Ba in? we denote 
by f; the function 1; of, the jth component, of f. 

We say that a function FeBo im functionally depends on its 


yuh variable x. if there are constants ¢ and d in 10,1}" 


i 
which differ only in their i” positions for which f(%) 4 f(a). 
Finally, if A is some set of input variables for f, and 
c,e{0,1} is a constant defined for each xeA, then we denote the 
restricted function of the remaining variables which is obtained 


from f by setting each variable x in A _ to cy by 


f | ee), for xeA ° 


We are particularly interested in one subclass of Boolean 


functions, the monotone Boolean functions (m.b.f.'s). Observe that 
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the set {0,1}" is partially ordered by the rule 


(xX, 2+ ++ X,) < (yy o+++s¥,) iff X s y; for ee We2yciean 
(where 0 < 1). a 


The set B, is partially ordered (in fact is a Boolean algebra) 


by defining, for FsgeB, on? 
f<sg iff f(x) < g(x) for all %et0,1}", 


A Boolean function feB""” is monotonic increasing, or 
monotone for short, ff and only if it preserves the partial | 
ordering <; that is, if tx) s (¥) for every pair of inputs x yel0, es 
for which x < y: 
Specific examples of m.b.f.'s include the function 


threshold k of n variables, denoted Tj’, which is defined by 


Ty ly oe++ Xp) = 1 if and only if at least k of the variables 
(x, ..-+5x%,) have the value 1. The threshold 1 function is. the 
disjunction XX oe VX. the threshold n function is the 
conjunction X{AXgA eines Another m.b.f. is the Boolean 
multiplication of two nxn matrices A and B a ee in 


a 2,2 which is defined by As B= ¢ where ney, pare ; 1 Aik 83) 


for each i and j in {1,2,...,n}. A third set of m.b.f.'s 


is Boolean convolution, in which we cerine +, s2n-] by 
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= ,V . 
FL K yore Xo] gre ya) jrgek GY) 


for each 0 < k < 2n-2. 
The m.b.f.'s satisfy several’ properties which are not true of 
Boolean functions in general. The following facts are easily 


verified. 


2.2 Lemma: Suppose f and gcB are monotone Boolean functions. 


Then 
(a) if fag = 0, then f=0 or g=0. 


and (b) if fvg=1, then f=1 or g=l. 


— | —_ 


Further properties of the m.b.f.'s will be explored later in 
this chapter. It is useful at this point to point cut one 


property of the ordering relationship on Bo 


2.3 Lemma: Suppose that f, g, and h are functions in Bo. If 


f<h and g<h, then 


(1) fan =f 
(2) fvh =h 
and (3) fags fs fvgsh.? 


t We remark that the dual of ai arbitrary function feB, is the function D(f) in 


). The truth of a statement 


B, defined by D(f)(X) = “F(—xy» —Xyse- eo X 
(cont. on next page) 


nH 
i 
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section B. Representations of Boolean Functions 


There are many different ways of representing Boolean functions . 
some of which give rise to the complexity measures: discussed in this 
thesis. One standard ‘representation is the truth table of the value 
of the function on different inputs. The int éying representation 
used in this thesis is that of a combinational (or gate-type) 
switching circuit, and is identical with those studied by funeyegs 
authors [Harrison 1965, Savage 1976, Paul 1977, Schnorr 1974 and 
others]. The underlying structure is a graph,-which we now define. 

A directed graph D consists of a pair of finite sets (V,E), 
where V is a set whose members are caTled nodes or vertices, and 
Ec VxV is a oct whose members are called e edges or arcs. arcs. Another 
notation for V and E is NODES(D) and EDGES(D) respectively. 

If (vsw EDGES(D) we say there is an edge from v to w in De 
A path or chain of length k in D is a sequence of nodes 
Voo¥, 9+ ++ o¥) (for k.2 0) of D such that there is an edge from 


V5 to Vie] for every ic{0,1,...,k-1}. A graph D is acyclic 
if there is no path from any node A of D to itself other than 


like Lemma 2.3 is unchanged if every function is replaced by its 
dual. . 
Observe that if f<g, then D(f) 2 D(g), and that the dual 


of a monotone function is monotone. 
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the trivial path of length 0. For any:node. A of an acyclic graph 


D, we define the following subsets of nodes of .D which pertain to 
A: 
Succ(A,D) = {BeNODES(D) | there is an edge from A to B} is 


the set of immediate successors of A; 
Succ (A,D) = {BeNODES(D) | there is a path from A to B in OD}, 


Succ’ (A,D) = {BeNODES(D) | there jis an path of length k 21 
from A to B}, 


Pred(A,D) = {BeNODES(D) | there is an edge. from B to A} is 


the set of immediate predecessors of A, 
Pred’(A,D) = {BeNODES(D) | there is a path from B to A in D}, 


and Pred*(A,D) = {BeNODES(D) | there is a path of length k > 1 
from B to A}. 


If W is some subset of nodes of D, we may extend the above 
devinitions to Wand speak, for example, of the set of immediate 


successors of W , defined by Succ(W,D) = USucc(A,D). Finally, 
AcW 


the indegree or fan~in (respectively outdegree or fan-out) of a 
node A is the number of edges directed into (out of) A in D 
and is denoted indeg(A,D) (outdeg(A,D)). In general’ we will omit 
mention of the graph D in the above notation when the graph is 


clear from context. 
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Suppose 2 is a finite set of primitive functions - g28, 
i 


called the basis. A combinational circuit or network over the 


basis ©, in short an g-circuit, is a directed acyclic graph N 


together with a labelling of the nodes and edges of N. which is 


subject to the following constraints and definitions: 


(1) For some positive integer'n, at most n of the nodes 


(2) 


(3) 


and (4) 


with indegree zero are given distinct labels from the 


set {Ky sXoeeeeaXats we refer to such nodes as input nodes 


~ and denote their set by INPUTS(N). 


There may be one node of indegree zero labelled with the 
symbol ZERO, and there may be one node of indegree zero 
labelled with the symbol ONE; such nodes are referred to 
as constant nodes. . | 

There are no other nodes of indegree zero other than those 


mentioned in (1) or (2). 


Each node with indegree one or more is. labelled with some 
member g, of 2. Each such node is a gate node, and 
their set is denoted GATES(N). Such a node G_ labelled 
with g, must have indegree in N equal to the arity 


i 
be labelled with one of the integers {1,2,....n} in 


n, of G53 furthermore, each edge directed into G must 


such a way that every edge into G gets a different label. 
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An example of such a network is given in Figure 2.1, where the 
basis is the set {a,v,;7 }. (For figures, the ares. in the graph are 
always directed downward. Arbitrary nodes are designated by triangles, 


constants or inputs by rectangles, and gates by circles). ~ 


Fig. 2.1 A Combinational Circuit 
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If the input nodes to an g-circuit N are labelled with members 
of the set {X, pees aX bs we may associate a Boolean function in B., : 
with every node A in N_ by the following inductive rule. We denote 
the associated function by -Res(A;N). If: A. isan input node 
labelled with the variable Xa then Res(A,N) is the projection on 


h 


the it coordinate, i.e. the function ‘1. If A is the constant 


node ZERQ (respectively ONE), then’. RES(A,N) = 0, the constant 
function in B, equal to 0 on all inputs (respectively RES(A,N) es 1, 
the constant function in B, equal to Th). Finally; suppose A is- | 
a gate for which every node in Pred(A) has an sasociated functions -. 
If we let B, be the node in Pred(A} such“that the edge from B, 
to A is labelled with k, and if A ig labelled with the basis 


element g,€2, then 
Res(A,N) = 9, (Res (B, .N) ,Res (Bz ,N),...ReS(By, »N)). 


We will say that an a-circuit N comp ss a function feBy iy 
iff for each i, 1 < ism, there is a node A, of | N Such that 
Res (A. ,N) =f,> the ith component of f.. For example, the 
circuit given in Figure 2.1 computes the function f(x, Xo Xa) = 
% & Xo a X3 since the gate with outdegree 0 computes this function. 
We may also say that the ant computes the function . 
x, 2Xo 2X3) = (x) > XA Xo) since there are gates which compute 


each component of g,. A node which computes a component of the 


designated function will be called an output node. 
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A basis 2 is called semi-compiete iff given any function FeB, 


for arbitrary n, there is an Q-circuit as described above which 
computes f. The basis is called complete if any f can be computed 
by an = &- circuit which does not contain the nodes ZERO and 
ONE (i.e. constants are not available). Three complete bases 
in which we will be interested are Bos the set of all binary 
Boolean functions, B. = Byu Bos and U = {av 5}. An. example 
of a semi-complete basis is the set {@,a} ~ An exhaustive 
characterization of all complete bases has been made by Post [1941], 
and the interested reader is referred to this work. 

One ~ jncomplete basis which holds special interest is the 


set M = {a,v} which is important for the following reason. 


2.4 Theorem : A Boolean function feB, i is monotone iff there 


is an M-circuit which computes f. 


For a proof of Theorem 2.4 the reader is referred to [Harrison 
1965 p. 189]. 

The bulk of this thesis is concerned with the results about 
M-circuits (also called monotone circuits) computing monotone 
Boolean functions. 

One special class of circuits which holds special interest 
is the set of Boolean formulas. An g-formula is an g-circuit 
in which each node in the circuit has outdegree at most one, and 


which has a unique node of fan-out zero corresponding to the unique 
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output node. For a formula, we remove the restriction on the 
circuit that there be only a single node for each input variable 
and constant, and allow any number of nodes of indegree zero 
labelled with any input variable or constant, each node having 
outdegree one. For an example see Figure 2.2. The graph of any 
a-formula is a tree in which each leaf node (one with indegree 


zero) is tabelled with a variable or constant, and each interior 


Fig. 2.2 The Formula (x, v Xo) A Ox Vv X3) 


node (every other node) is labelled with a gate. This formulation 
is easily seen to be equivalent to the usual inductive definition 
of an Q2-formula. Clearly a formula can compute only a single-output 


function. 
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Classically, Boolean formulas’ have been extensively studied 
as a means of representing Boolean: functions. Many. formula schemes 
have been proposed as canonical means for their representation, 
including the classical disjunctive normal form. In this method, 


a function FeB, is represented as the formula 


c- -¢ oe 
V % "54 ara n 
(cy. oo -2C,)eAg 
where x) = eae EN 
ax if j=0 


for any variable x, and Ag is those set of constants %e(0,1}" 
for which f(¢) = 1. Other canonical forms are discussed in 
[Savage 1976]. | i. 
If F is a formula, and H. jis a node in F, then Fy will 
denote the subformla of F above. H; that is, Pred’ (H,F). 
In this thesis, we will occasionally replace part of one 
circuit by another circuit and speak of combinations of circuits. 


For example, suppose that N and N’ are g-circuits which contain 


nodes A and A’ respectively. The circuit obtained by replacing 
A by A* is constructed by first identifying the corresponding 
input and constant nodes of N and N° and considering them as 
one network. Node A together with each arc directed into it is 


removed from the graph, and each edge (A,B) originally in circuit 
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N is replaced by the edge (A°,B) (with the same label as previously). 


For example, see Figure 2.3. The replacement of a node in a circuit N 
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by a node in the same circuit is defined similarly, but substitution 
~ must not -be made in a manner which introduces a cycle into the 
graph. 

One may say little in general about the functions computed by 
a circuit in which one gate has been replaced by another. Specific 
situations may allow some conclusions to be drawn. For example, 
if node A in N is replaced by a node A’ in N° for which 
Res(A,N) = Res(A’,N~), then it is easy to prove by. induction on the 
length of paths that every remaining node in N- computes the 
same function as it did before the substitution. 

One special replacement is the substitution of a constant © 
for a variable. We say that N° is the circuit obtained from N 
by setting variable x, to 0 (respectively 1) if N° is obtained 
by replacing variable node x; in-N by the constant node ZERO 
(respectively ONE). One inductively defines setting a collection 
of variables to a constant. It is Stati easy to verify the . 


following fact. 


2.5 Lemma: Suppose N is an g-circuit, and x,<INPUTS(N) . If 
N* is.the circuit obtained by setting variable Xs to 0 , then 


for each remaining node A in N’ , 


Res(A,N~) = Res(A,N)| 2% : 
i 
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A similar statement may be made if X is set to 1. 
Observe that, depending on. the basis, if agate ina circuit | 
has one input which is constant, then some simplification of the 


circuit may be made. For example, suppose the basis is the set Bo 


of all binary Boolean functions. If a gaté has a constant input 
then, except for trivial cases, it may be eliminated from the 
circuit since the output of that function-is then a unary function 
of the other predecessor. Since this unary function may be 
absorbed into a preceding or succeding binary gate (if one such 
exists), then one may obtain a smaller circuit equivalent to the 


original. See Figure 2.4 for an example. Such simplifications may 


Fig. 2.4 Some Simplifications 
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always be made in the basis M. This elimination of dates will be 
a principal tool by which icwer bounds: on circuit complexity are 
obtained. | | 

One additional, representation for a Boolean function is as a 
set of prime implicants. In the interest of brevity we restrict 
ourselves to monotone Boolean functions, although a similar devel- 
opmen may be made for all Boolean functions. oe | 

A monome or product is a product of distinct variables, e.g. 
X7XoXs is a monome. In particular, we denote the empty monome 
by & A monome m_ includes a monome m* if every variable 
which appears in m~* also appears in m. We denote the function 
defined by a monome m by T(m), and if M is a set of monomes, 


then T(M) will denote the function V T(m). T(e) is the . 
MeM . 


constant function 1. 

Suppose that f is a monotone Boolean function. A monome m 
is an implicant of f iff T(m) s f3;m is said to be a prime 
implicant of f if m includes no other implicant of f. We 
denote the set of all prime implicants of f by PI(f)*, 

‘For example, suppose F(x) Xo sXq) is the threshold 2 function 
of three variables, namely ie ; ‘Then XyXXq_«is 


an implicant of f, but fs not a prime implicant since the monome 


t ; | 
One defines V fim) = 0. Hence PI(Q) = @, the empty set, and PI(1) = {e}. 
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XX «is also an implicant of f. PI(f) is the set {x,%5». XpXq° KX) 
The importance of the set of prime implicants is that they are 
a canonical representation for the m.b.f.'s, and are fairly @asy 
to manipulate mathematically. A straightforward proof (see 
[Harrison 1965 p. 190] for example) demonstrates ‘the fol lowing fact 


of their canonicity. 


2.6 Lemma (Quine): Suppose feB. isa monotone Boolean function. 

Then f = V T(m). Moreover, if R is any set of monomes such 
mePI{ fF}. - 

that no monome in P includes another monome in FP, and 


f=T(P), then P = PI(f). 


The set of prime implicants of a combination of functions can 
be obtained from those of the constituent functions by means of the 
following simple set of rules. Suppose that f and q are m.b.f.'s 
in Ba Then for any monome m, m 4s an implicant of the function 


fvg tf and only if it is an implicant of f or an implicant of g. 


T Note that we equate the monomes XX and XpXy- Actually, the 
set of all monomes is equivalent to the free monoid on: the elements 
{X) sXooeee Xp } modulo the relations of the comoutativity (xy = yx 
for all variables x,y) and idempotency (xx.= x. for all-x). 


€ is: the identity element for this algebraic system. 
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Moreover, PI(fvg) is obtained from the union PI{f)uPI(g) by: 


eliminating all monomes which include another monome in the union. 
For example, if PI(fp) = {Xq XpXq 0X4} and PI(g0) = {X1 9XqoXgXq}> 
then PI(fp¥go) = {Xq sXooX4}- If f and g arem.b.f.'s in 

Bh? and m is a monome such that mePI(fvg), then mePI(f) or 
mePI(g), but the converse is not in general true. By Lemma 2.1(a), 
if PI(fvg) = @, then both PI(f) and PI(g) are empty sets. 

The set of prime implicants of the function fag “tay be 
obtained by forming the set of all products m-n of a-monome 
mePI(f) and amonome nePI(g) (where duplicate variables in n 
and m are reduced by the rule x*x = x), and then eliminating 
any products which include other products in the set. For fy 
and gy of the previous example, PI(fyaSp) = {Ky oXpXq oXqXq oXgXq)- 
Again, if tePI(fag), then there are monomes mePI(f) and nePI(g) 
such that men = t, but the converse is not in general true. 

One may relate the ordering relation < on the m.b.f.'s with 
their canonical representation using prime implicants by the following 


general result. 


2.7 Lemma: Suppose f and geB, are m.b.f.'s. Then f<g iff 


every monome mePI(f) includes some monome  nePI(g). 


Proof: Suppose f<g, and mePI(f). By Lemma 2.6 and 


Lemma 2.3(c), T(m) s V T(n) = f s g, so by the transitivity of <, 
nePI(f) 
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T(m) sg. Hence by definition m is an implicant of g. Define 


My to be an implicant of _g which is included in’ m and which 
includes no other implicant of g. Then My. isa prime: implicant 
of g which is included in. m.- . 

Conversely, suppose every monome in PI(f) includes some 
monome in PI(g), and let Ce{0,1}" be such that f(¢)= 1. By 
Lemma 2.6, there is some monome _m «PI(f) -such that. T(m 2) = 1. 
By hypothesis, there is a monome nin PI(g) included in m ; 
By Lemma 2.3(c), we know that T(m).< T(n). Hence T(n )(@) = 1, 


and since n is an implicant of g, g(t) = 1 as well. 


Lemma 2.7. 


One may define classes of Boolean’ functions in terms of 
structural properties of their set of prime implicants. For example, 
we will say that a m.b.f. feB, is quadratic if every monome in 
PI(f) consists of the product of exactly two distinct variables; 
am.b.f. FB, 1 quadratic iff every component f; is 
quadratic (for 1< ism). 

The set of prime implicants of a function has been extensively | 
used as a tool in explaining arguments in this field [Paterson 1975 
Mehlhorn and Galil 1976, Lamagna 1975]. ‘This representation ‘is 
particularly helpful in describing several replacements which may be 
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Galil [1976] of which special cases have been used by other authors ; 
we will find use for one of them later in the thesis. If N is 
any M-circuit, and A is anode in N, then we will denote 
PI(Res(A,N)) by PI(A,N), and as usual] omit mention of N whenever 


the circuit is obvious. 


2.8 Lemma (Mehlhorn and Galil: Suppose N jis an M-circuit, A is 


any node in N, and PI(A) = {MyM >... 9m If there is no 


by 
k 
output node B of N with My*nePI(B) “for some monome n, then 
one may replace A_ by a gate which computes the function 

T(m, sm se ee omy) without changing the function computed at any 


output gate. 


For a proof of this result and several other general replace- 
ments for M-circuits the reader is referred to the paper [Mehlhorn 
and Galil 1976]. 

One final topic which pertains to circuits in general is the 
different notati* -s of dependence. We will say that'a node A in 
a circuit N depends functionally on the variable X, if 
Res(A,N) depends: functionally on Xie A different notion of 
structural (path or syntactic) dependence:holds.if there is a path 
from input node x, to A in N, i.e. if AeSuce’ (x; 5N). 

Clearly if A. depends functionally on Xs then it must also 


depend structurally on Kes but in general the two notions do not 
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coincide. It is not known whether they coincide for minimal sized 
circuits over particular bases (M for example}. However, it is 


possible to prove the following useful observation. 


2.9 Lemma: Suppose a function feB, depends funct onal ly on a 
variable Xa and N is an Q-circuit computing f with 

Res(U,N) = f for a node. U in N. Then there is. a path from 

x to U in N such that every gate in the path depends function- 
ally on x,. 


i 
Proof: We construct the chain backwards from the output gate , 
U. Let U= Up: Inductively assume that we have constructed a | 
path Ge aiecaseccitly in N such that every node in the path 
depends on x If U = Xe then the proof is complete. Other-. 
wise, UL must be a gate since no other input or constant node 
depends on Xx;. Hence, since Res(U,) _, depends.on Xgs there 
must be a node BePred(U, ) which depends functionally on. Xa 

We extend the path by defining Urey = B.. (See Figure 2:.5) 


Since N contains no cycles, the.nedes selected are a]l . 
distinct. Since: there are only a finite number of nodes in N; 
the above process must eventually halt. : 


O Lemma 2.9 
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Fig. 2.5 The Extension 


ree 


2.10 Corollarv: If a function feB,» which depends on Ke is not 
the projection function Ts and if N is an g-circuit which computes 


f, then outdeg(x, .N) ats 
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Section C. Complexity Measures ; 

The models of computing Boolean functions introduced in the 
previous section naturally give rise to various complexity measures. 
If a is an arbitrary basis and N is an a-circuit, then we 
define C.(N) to be the number, of gate-type nodes in N. If 
feB, > then we define the conbijattonal (or circuit) complexity of 
f with respect to 2, denoted G (f), to be the minimum value of 
¢ Q(X) for all a- ~circuits N whch compute f. If f is not 
definable by any 9- circuit, ‘we defines Cc olf) =o, 

For formulas, one possible complexity measure is to consider 
the number of gates as was done abeve for circuits. We instead 
use the number of occurrences of literals as our complexity measure. 
If F is an 2-formula, we define i, (F) to be the number of occur- 
rences of variable input fodes-t#@the circuit associated with F 
(the number of leaves of the. tree corresponding to F. which are 
labelled with variables). Observe that if a- ds a binary basis, 
and F contains no constant nodes, then t(#) js one more than 
the number of gates in F. The formula complexity of a function 
feB, with respect to , denoted 1,(f), is the minimum of the 
values of: Lo(F) | for all a-formulas: F which define f. Again, 
if f is not definable by an 9-formula, we define L(f) > o. A 
minimal Q-formula (respectively a-circuit) for f is an 2-formula 


-F (respectively 2-circuit N) which computes f such that 
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LCF) = Lo?) (respectively -C,(N)..= Co(f)). ‘Where there is no 
ambiguity: we will use the same notations of complexity Ly for 
formulas and C, for circuits as well as for functions. 

For the particular monotone basis. M, we will: use the alternate 
notations MC and ML for Cy and Ly respectively. For this 
basis, we also define a complexity measure which depends on the 
types of gates used in the circuit or formula. If N= is an M- 
circuit, define MC (N) and MC.(N) to be the number of v-gates 
and A-gates respectively in N. If f is am.b.f., we define 
MC (f) (respectively ML (F)) to be the minimum value of MC, (N) 
as N ranges over all M-circuits (respectively formulas) which 
compute f (where here we consider a formula as a special type of 
circuit). We similarly define MC (f) and ML, (f). An v-minimal 
(respectively A-minimal) M-circuit for a m.b.f. +t is one which 
has the minimal number MC,(f) of v-gates (respectively 
MC (f) of a-gates). 

The above are the complexity measures considered in the thesis, 
but other measures can be defined. For example, the depth of a 
circuit can be defined to be the length of the longest path in the 
circuit. This measure has been studied by Spira [1971] and more 
recently has been studied by several authors [McColl 1977, 

McColl and Paterson 1977, Paterson and Valiant 1976, and | 
Borodin 1977]. One may sintiarly define the breadth of a circuit; 
this measure has recently been. studied by Schnorr [1976b]. Finally, 


one may choose to consider only circuits in which the outdegree of 
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of each gate in the circuit is bounded above by ‘some integer k. 
Johnson, Savage, and Welch establishéd ‘the’resutt ‘that if-k22, 
such a complexity measure is proportfonat’ to C, for any complete 
basis 2, (for proof, see [Savage 1976 ‘p.24]). . 

The. relationship between these complexity measures of an 
individual function is welt-understovd’ in some’ ¢ircunstances and 
unknown in others. Muller [1956] pointed*out that°in general, the 
circuit complexity of a funetion in’ any- complete Basis ag is pro- . 
portional to its Cs, complexity. -This i§'true-since any gate in 
B, may»be synthesized by an a-subeircuit and vice versa, so one 
can replace a gate by a subcircuit with only a constant: factor’ 
increase in the total number of gates’. For formulas, the choice 
of complete basis can make a difference in formula size. For’ the 
parity function - Fg(xz 20+ -oX,) = AyBXGD. » OX, > — Khrapchenko {1971] 
demonstrated that 1,(fg) 4s proportiotial to “n", while fy 
clearly has a formula of size n in the basis “By. Pratt [1975] 
has recently shown that the maxtmal-gap' between the complexity in 
any patr of binary ip guest cannot be much Targer by demon-— 


strating that Ly(f) <. [L, off) 109310. for any: FB, 


‘The relationship cs the By-complexity of a monotone 


function and its complexity in the incomplete basis MW As net as 


Tin fact, C3 (f) = Ge (f) for any / function fB, » and if. f depends on 
2 

more than one variable, then Ca (Ff) is the minimal number of binary. 

gates in any B- circuit which ee f. See (Savage 1976 p. 39] 


for a Reet of this fact. 
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well understood. Specific results wil] be shown in Chapter 6. 

Results about the maximum values of these complexity measures 
among all Boolean functions were mentioned in the introduction. Jn 
brief, the number of distinct circuits with at most a fixed mumber 
k of gates is bounded above by ck k for some constants c and 
c’ depending on the basis.. From this fact, one can easily show 
that if there are many different functions to compute, then there 
are not enough small circuits to compute them all, so "most" of 
them require a large number of gates. What is remarkable is that 
soem clever constructions exist which enable one to compute .al]- 

Boolean functions in certain classes with circuits. or formulas 
which are close to, or even asymptotic with, the lower bounds on 
size obtained by the above argument. 

For the class of all Boolean functions, Shannon [1949] and 
Lupanov [1958] have shown that the maximal Bo circuit complexity 
of any Boolean function eB, is asymptotically 2"n. Similarly, 
it is known [Krichevskii 1961, Lupanov 1962] that the maximal B. | 
formula complexity is proportional to 2" og n. 

Recent attention has been focused on the class of monotone 
Boolean. functions. Kleitman and Markowsky [1975], extending work 
of numerous authors, have shown that there are asymptotically 


(pia) oe 
2 h/2 m.b.f.'s of n variables, and one obtains a correspond- 


3/2 


ing lower bound of order 2"/n on the maximal Bo (and. M) 
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circuit complexity of m.b.f.'s. Pippenger [1976] has shown that 
this lower bound is attainable in showing that ‘any m.b.f. feB, 
can be computed in a Bo-circuit. with asympteticalty of2"n3/2 ) 
gates and an M-circuit with asymptoticalty 0(2" 109 n/n?/? nates. 

Lower bounds on the complexity of specific functions dominate 
the bulk of this thesis. It will be helpful to point ‘out’ some 
facts about minimal circuits. If FeB, in and 2° are arbitrary, 
then any minimal Q-circuit for f has at: sost “m-gates of outdegree 
0. This is so since any gate of outdegree 0 which is not an output 
gate may. be eliminated from the circuit without changing the 
functions computed. at the output nodes. 

In a similar fashion, one can show that no two gates in a 
minimal a-circuit compute the same-function. Suppose G and G° 
were distinct gates in a minimal g-circuit N for which 
Res(G,N). = Res(G*,N). Since. Nis acyctic, either GfSucc (6°) 
or. G-éSucc (G); w.1.0.g. we assume the former. Then we eliminate 
gate G, and replace G by G” in . Pred(H,N) for each gate 

H eSucc(G,N). It is easily shown that the resulting circuit is 
acyclic, and that every renaining gate-4n the new -circuit computes 
the same function as jit did originally. This is a* contradiction 
since N was a minimal Q-circuit. a | 

Suppose that a function feB, depends on at least two variables. 
Then shy Q-circuit or formula for f muSt contain-at least one 


binary gate (where 2 is Bo» B, U, or M). By absorbing any 
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unary gates into the binary gates, one can show.that in any minimal 


g-circuit. for f there are no edges directed out. from any constant 
nodes since each gate with one input constant..can be.simplified: 
and eliminated. Similarly, no node G has two edges directed into 
the same node H, so outdeg(G) is equal tothe nymber of nodes in 
Succ(G). | 

For minimal formulas, we point out that the.notions of 
structural and functional dependence on a variable Xs coincide, 
’ To prove this, suppose that ® jis an arbitrary basis, and F is 
a minimal °-formula for ‘fB,. If H is a gate in F which 
does not depend functionally on the variate ces then we claim 
that there are no occurrences of the variable Xe in Fue If 
there were, then one could replace all occurrences of x in Fy, 
by the constant 0 (or 1) without changing the function. computed’ 
at H and hence by the entire formula F. Since this reduces the 


number of occurrences of inputs in F, F, must not contain any 


H 
occurrences of X;- 


Finally, we point out one simple lower bound on the complexity 


of most functions. 


2.11 Lemma (Savage): Suppose a function feB. depends on each of 


its variables. Then Cy (f) = n-1. 
2 


Proof: Suppose N is a minimal Bo-circuit for f which 
has k gates. Observe that since each input node has outdegree 


at least one and since there is at most one gaté of outdegree 0, 


we have 
. } _ indegree (A,N) = 2k 
AeNodes(N) = 
and a) outdegree(A,N)-2.ntk=1:. 
AcNodes(N) 
Hence, since 
) _ indegree(A,N) = ine outdegree(A,N) ; 
AeNodes (N) eantesia) get ee 


we obtain 2k 2 ntk-] which establishes the:,]emma. 


O Lemma 2.11 


Section D. Circuits and Turing Machines 


One question which naturally arises concerns the relationship 


between the measures of complexity defined above and ordinary 
Turing Machine (T.M.) complexity. As. mentioned -above, every finite 
function is "easy" for a Turing Machine to compute since it may be 
computed by finite state machine. When one considers the finite 


subproblems of an infinite problem, however, the situation is 
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different. For concreteness, consider language recognition 
problems,.i.e, functions f:0,1}"> {0,1} (where {0,1}. is. 
the set of all finite-length words over the alphabet {0,1}), and 
let f, be the restriction of f to words of length n. Consider 


the circuit complexity measure Ce ; 
2 


One can show that there are functions f which are computable 
by a deterministic T.M. in space 2" Gieaca relatively easy function) 
for which CBF) is near-maximal for all valuesof n (see 
[Stockmeyer 1974] for proof). On the other hand, one can exhibit 
arbitrarily complex, even arbitrarily non-recursive, functions f 
such that Cp, (Fn) is 1 for every neN. There thus appesrs to 
be little correlation between the T.M. complexity of a function and 
the circuit complexity of its finite pieces. | : 

A strong relationship between the two measures comes from a 
result due to Pippenger and Fischer [1977] which asserts the following: 
Suppose f is computable by a T.M. M operating in time T(n) for all n. 
Then there is a constant c (depending on M's state diagram) such that 
c(f,) s cT(n)logT(n). The proof is obtained by simulating machine M by 
an "oblivious" T.M. M* which operates in time T-logT - the positions 
of the heads of M- depend only on the input length n. Then it is 
actin that any oblivious T.M. can have its operation on an input of 
length n simulated by a number of gates proportional to its running 


time, 
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This remark has many applications [Paul 1976]. As an example, 
Fischer and Meyer [1971] have shown that Strassen's fast integer 
matrix multiplication algorithm can be.used:to multiply two 
Boolean nxn matrices in time 0(n 99271 ogn toglogn Yeglogl ogn) 
on a Turing Machine. Using the oblivious -construction-of a 
circuit simulating the T.M., one thus obtains a Bo-circuit for vatrix 
multiplication with o(n?*82) gates. (Recall that Paterson [1975] afr? 
and MehThorn and Galt1 [1976] have shown that any M-circuit 
for this problem must have a(n?) gates.) 
While Fischer's result ere technical, tool, it is 
obvious by the earlier remarks that the converse result does not . 
hold. Schnorr [1976d] has strengthened the result by considering 
oracle T.M.'s in which one work tape initially holds some string. 
A in {0,1}" . Define, for f:{0,1)" + {0.1}, . : 
TC(f)(n) = ming |M|-TA(n).1og SA(n)_ |: with oracle A computes f} 
where M ranges over all oracle. T.M.'s (see [Schnorr 1976d] for 


A 


| conventions > T and sare the time and space required by M, . 


and |M| is the number of instructions. in program. M. Then there 


exists some fixed polynomial -p and a.constant c such that. 


Cy (fis ceTC(#) and TC(f) <P (Cy (F)). 


Section E. Misceilaneous Notations 


Frequently we will be concerned with the rate of growth of a 


numeric function rather tian its exact value. Suppose f and g 


~Si- 


are functions defined on the natural numbers. Then we write 


f = 0(g) iff there is a constant c > 0 such that lim sup (f(n)/g(n)) < c 
te, 
(i.e. if f(n) sc g(n) for all sufficfently large n), 
f= o(g) iff limit f(n)/g(n) = 0 , >= ; 
ho 


+ 
" 


a(g) iff there is a constant c > 0 such that Vim sup (f(n)/g(n)) +c 
(i.e. if f(n) = g(n) for all sufficiently large n), 
f = 0(g) iff f = 0(g) and f = a(g), | 
f~g iff limit f = ° 
g Vimi (n)/g(n) = 1, 


and 


f<g iff lim sup f(n)/g(m) <1. — 
ho 


If f= 0(g) we say that f is proportional to g, if 
f~g we say that f is asymptotically equal to g, and if 
f sg we say that f is asymptotically less than g.. 


If A is any finite set, then |A| , the cardinality of A, 
is the number of elements in A. A partition of A is a collection 
k 
UB, =A. 
i= k 
If A and B are sets, then A-B = {x | xeA and xéB}. 


of pairwise disjoint sets By »Bo>+-» sBy such that 


If X is a set of variables, an X-variable is a member of X. 

I is the set of natural numbers, and R is the set of real © 
numbers ; R™ = {reR | r2 O}uf@} 

If k < % are natural numbers, the set of consecutive natural 


numbers {k,k+],k+2,...,2-1,2} is denoted [k:2]. 


me 

If x is a real number, then, xj is the greatest integer less 
than or equal to x; [xl is the least integer greater than or eaual 
to x. 

"Iff" is an abbreviation for "if and only if", and "w.1.0.g." 
is an abbreviation for "without loss of generality”. 

d is the empty set. 


! 
If k and g are natural numbers, then (K) = K 


a’ oat (n-g)! * 
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CHAPTER 3 


Worst-Case Values for the Complexity of Quadratic Functions 


We begin by considering several complexity measures for monotone 
Boolean functions, and examine the complexity of quadratic m.b.f.'s 
under these measures. Recall that a monotone function FeB,, 1s 
quadratic iff for each icf :m], component ? of - has a set 
of prime implicants PI(f;) equal to {x5 +X) | {j,k}<eA;}, where 

A. is some collection of subsets of [1:n] of cardinality two; 


equivalently, f,(x,s-+..%,) = The "threshold 


(5h Yea, (x; X,)- 
two" function of n variables 1 is quadratic since 

PI(T, )= {x 5° | Joke[T:n], j # k}. Boolean matrix multiplication 

and Boolean convolution are additional examples. 

In this chapter we primarily consider. the measures MC, MC a? 

‘and MC, for circuits, and corresponding measures ML ML» and | 
ML. for formulas. For any M-circuit N, we have the relationship 
MC (N) + MC (N) = MC(N). For afi M-formula F which does not contain 
any constant inouts, we have the corresponding equation 
ML (F).+ ML (F) = ML(F) - i Hence for an arbitrary monotone 


function feB we have the relationships 


“The number of gates in an M-formula is one less than the number of 


occurrences of variable and constant inputs. 


26a 


MCF) + MC (f) < MCcf) 
and 4 


ML (f) + ML(f) < ML(f) - 1, 


but at least the first inequal ity may be strict for specific 
functions f. For excep lS in Chapter 5 we will show that 
MC,(T5) = {Tog i} and MC Tp) = n-4, ‘but, Will also ‘show that if 
n is a power of 2, any-M-circuit for a which has exactly ~~ 
Tlog n! a-gates must also have at least an + 3Mog fl - 9 
v-gates, and hence any M-circuit for aust have at least one 
more gate than the sum MC allo ) + Me wT) for idea! values ‘of n. 
Two areas in the complexity of © ‘single-output quadratic m. b. f. s 
are presented in this chapter. ‘The asyiptotic value of the largest 
complexity for all quadratic m.b.f.! Ig is explored first. It is | 
shown that every quadratic m. b. f. in B, has an M-circuit 
complexity of 0(n°/log n). That thts “upper bound is within a 
constant factor of the best possible result is shown by proving that 
"most" quadratic m. b. f. 's have circuit complexity over the larger 
basis By of a(n2/log n). . 

_ The other area is the determination of the number of A-gates 
necessary and sufficient to compute’ any quadratic m.b.f. in B,: 
We show that .n-1 s-gates are sufficient in any M-formula, and 
exhibit a quadratic m.b.f. for which (2/309 a-jates are necessary 
in any M-circuit. By restricting the types of M-circuits allowed, 


we can demonstrate a quadratic function for which n - o(n) a-gates 


are necessary in any restricted M-circuit, due to an observation by V.Chvatal. 


— ees i EES ae 


We first demonstrate closely matching upper and lower bounds 
for the worst-case complexity of quadratic monotone Boolean functions. 
Definition: Suppose neNN . Define 
QC(n) = max{MC(f)|f is a quadratic m.b.f. in ‘Bits 


Qc (n) = max{MC (f)|f is a quadratic m.b.f. in Bits 


and QC(n) = max{MC\(f)|f 4s a quadratic m.b.f. in BL}. 
We similarly define QL(n), QL .(n); and QLy(n). 


We can use standard counting techniques as found in- numerous’ 


Proofs [Shannon 1949, Lupunov 1962, Fischer 1974] to establish lower bounds 
on QC. 


3.1 Theorem: Suppose «> 0. Then most quadratic m.b.f.'s f in By 


have 
Cp (f) = (1 - ©)n*/(4 log n) 
2 


for n_ sufficiently large, where Bo is the complete basis of all 


two -input Boolean functions. 


Proof: Let Q, £8, denote the set of all quadratic m.b.f.'s 
of n variables. Since there are 8) subsets of the 
Variables {x).X,....x,} with two members, Qt = a2) - 1. 


Suppose eQ.- We know that there is some Bo-circuit N which 
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computes f since Bo is a complete basis. If N contains two 
~ gates which compute the same function, then as in Chapter 2, one of — 
these may be eliminated to yield a yeah with fewer gates which 
also computes f. Also, if f is not. a constant function: it is pos- 
sible to construct a Bo-circuit: for. fin which each constant node 
has outdegree zero. We thus consider only B,-circuits in which each 
gate computes a distinct function, and in which: constant nodes’ have ~ 
outdegree zero; we call such circuits: reduced... 

Now suppose qeWN and ‘Nis a reduced Bo-circuit: of size <q 
which computes a function feQ,. By possibly adding additional ‘tumty 7 
gates, we may assume that there is a reduced Byrcireutt of size eXx-. 
actly q_ in which one of the gates computes f. Let Ra denote the 
number of quadratic m.b.f.'s computed by such reduced circuits of 
size q. Since each gate has two inputs, and can be labelled with any 
of the 16 binary Boolean functions, there are at most 16(q + nj | 
possible assignments of function- a neue pair combinations for each. 
gate in a reduced Bp-circuit, giving a total of at most [16(q + n) aya. 
such assignments of reduced B,-circuits. However, many of these la- 
bellings describe the same circuit -- in fact, there are q! differ- 
ent enumerations of the computational nodes of..any fixed: reduced circuit. 
Hence, there ave at most 16q + n)?%a! reduced circuits of size . 

q. Since the quadratig output function may be computed at any gate of 


the circuit, we have 


5 7= 
Ry a+ 169 (as n)*4qt 


< (169(q + n)24*T 409) 7( /onq- q4) (using Stirling's 
approximee ion) 


< ¥(2/7)(64e)"- ao (1/2) (when q =n) 
< (cq)9, ( for some c20 and q_ suf- 


ficiently large ) 
Now, suppose that qs (1-e)(5) / log (D)- Then 
yan n 
Ry < Ce (I-e)(5) / tog (5)] (-eMCg) / Tog (2) 
n n ; 
< (5) (1-e)(5) / log (9) ~  ( when log (5) = c(1-<) ) 
< 2(l-e)(5) 
Hence the fraction of all quadratic m.b.f.'s counted in Ra is 
n of n 
R/2 (2) lt-e) 2) 2 (9) = 27° Q) 


which tends to 0 as n+. Thus most quadratic functions have Bo 
complexity at least ee / log 6) ~ “(e)(n? (4 Tog n). 
( Theorem 3.1 


3.2 Corollary: QC(n) = a(n? /log n). 


Proof: For any quadratic m.b.f. f, MC(f) 2 Cp (f) since 
. ot D 
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Mc Boe Oo Corollary 3.2 


We can prove that alt quadratic. mb.f.'s can be computed using a 
monotone circuit which is no more than a small constant factor larger 
than this bound. The idea of the proof is similar to that used in 
the Four Russian's algorithm for transitive closure ‘[Arlazarov et al. 
1970], and has been used previously for Eonstmicetne effictent cir- 


cuits for other classes of functions [Tarjan 1976]. 


3.3 Theorem: Suppose eB, ts a monotone quadratic Boolean function. 
Then 


MC(f) < 4 n@/log n + O(n). 


Proof: Let m= log ny. We. Partition me Anput variables of 
f into n/m! subsets, the (jth subset Se dbnsisting of the set 
S, = {x| (i-1)m <jsi-m, j sn} for ie{lin/m]. Each subset, with: 
the possible exception: of the last, has’ m variables: in it. If feQ, 


is given, for each input variable x, and each subset Sj we define 
Aa, = {xpeS,|k 2 i and xyxpePI(f)}. 


Note that Ay; =9 for j<li/ml Let Ay; be the Boolean function 
of the disjunction of all variables ff Aaj (recall that by- conven- 
tion Ai is the constant function 0 when Ay = 9). We clearly 
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can express f as the factorization 


a Ca/al 
i aes ae ee « ayia)? i 


Qur goal will be to compute the functions Aig efficiently, and 
use the decomposition (1) to compute f. We make use of the following 
result [Tarjan 1976] which allows efficient computation of sets of 


disjunctions. 


3.4 Lemma: Suppose X is a set of m variables, and Fy ofos-oesfy 
are monotone Boolean functions which are the disjunction of subsets 

of variables of X. Then there exists a circuit N consisting solely 
of v-gates which computes F = {f) fos.--sf} such that 

Coyy(N) < 4km/log k . 


Proof of lemma: Let r={log k. We decompose the set of X var- 
jables into [Fm/rl subsets Tyolgee ela each containing no more 


than r variables. For each i, the circuit N initially computes 


the disjunction of all possible subsets of T;- This can be done ina 
manner which has one disjunction for each subset of size 22 by com- 
puting the disjunction of smaller subsets first, and then constructing 
the disjunction of a larger set with one additional variable by using 
a single v-gate to add this variable to the previously constructed 
disjunction of the smaller set. Hence, this can be performed using at 


most 2°-r-1 disjunctions for each ic[1-m/Yi], giving a total of 
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(2° - r= 1)(m/r}) < Km/r1 disjunctions . Since each function 
a oe 3 * 
these previously computed funct#ons {one per subset T,), we can com- 


can be constructed by computing the di sjunction of Tm/rl. of 
pute F with at most an additional k(m/r1 - 1) v-gates. Since 
Tin/r ds 2n/ 10g k,! the total number of " vegates used in this constryc- 
tion is at most 2K mW/rl- k < 4km/log k. | sat, hi 
— | Lemma 3.4 

Returning to the proof of Theorem 3.3, suppose that j “is 
fixed. Note that Ay, #@ -for at-most j-m values of i, namely 
for each ie[]:j-m]. Hence the computation of the functions Ai; 
for all jem values of i can be accomplished using the method of © 
Lemma 3.4 with at most -43m°/10g (jm) “vegates. Thus, one can compute 
all functions Aa for ie[lin], je[1:"n/m] usiagiat- most — 

Tn/m) 5 shee 6250 Paar eae 

ee (4jm")/(Tog(jm)) = 4m°{ ie J/los{ jm) ) 


= 4ni[n®/(2a° Jog n) + O(n/ log” n)j 


2 2n@/1og n + Ofn) a v-gates. 


*To sum the series, note that 


Ca/n : n) 
7 §/log (im) < J!" x10g(xm)) dx ean 


s (1/m@) ip x/log x dx + 2n/(m. log n) 


(cont. ‘on next page) 
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Finally, combining these disjunctions ‘to compute f using (1) 
can be done using an additional n-~ 1 a-gates (since Aj = 0 


for all j); the number of additional v--gates is at most 


; a 
LOO = AT) + 0-1 < (inet bm #1) tn-1 


cy y 
Seb + 2n-1 
= ) Jj/m + 2n 


Again, since n@/m < 2n*/log n, we have a total of 4n¢/log n + O(n) 


gates in the M-circuit constructed . re 

er a O Theorem 3.3 
Section B. Asymptotic Bounds on a-gates 

In the remainder of this chapter, we consider the measures QC, 
and QL, which are the minimal number of a-gates required in any 
But 


n bea . 
Uh x/log x dx < Y i/log i Yat [i/log i + (n-it2)/ log(n-it2)] 
oe i=2 i 
Ln /2J+] 
2° 2. she EU rate +1) 


2(Ln/2rn/21))/(2 log (-H/2441)) 


tt 


n@/2 log n + 0(n/log n). 
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circuit or formula respectively .to express a-quadratic m.b.f. of 

n variabtes. We do not consider the measures QC and QL, and 
consider determination of their values :to:be an-interesting open’ | 
question. When arbitrary M-circuits are allowed, the bounds we obtain 
on A~comptexi ty. are not as. tight as those’ ‘in ‘the previous section, 

but tighter bounds are given in Section C when 

A-gates are not allowed ‘to: have a path to another a-gate in the 


circuit. 


3.5 Lemma: Suppose n22. Then QC (n) s:QL, (n)..s n-1.- 


Proof: Obviously QC, (n) s Qt (ny “since a formula may be 


gonsidered a 1 special kind of a circuit. 


We prove that QL (n) < n-1 by fnduttfon on on. 


If ne 2, then clearly QL, (2) 1 since ‘the only quatvants 
m.b.f. of 2 variables is Bog) = X1 A Xpe . 

Now assume inductively that QL, (n) < n-1, and suppose that 
f is a quadratic m.b.f. of ntl. variables. _,We can express ; f 


‘as the factorization 


F(Xp eee Xa) * (x Katy * (Xp o+- XQ) v AUX] oe + XQ) 


where g = Vix, | x,-x,,,¢ePI(f)} and 
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he Vis; | x4 xjePI(F), 1 # nel, j # nt1} 


Clearly ML (g) = 0 since it is simply a disjunction of single 
variables. Also ML (h) < QL(n) since h is a quadratic m.b.f. of 
n variables. Inductively, we then know that ML (h) <n-l, so 
ML (f) < ML (9) + ML (h) +] <n. 

O Lemma 3.5 


Remark: One can show that QL,(4) = 2 by examining all quadratic 
m.b.f.'s of 4 variables, and hence a similar proof demonstrates 


that QL.(n) <n-2 for n2 4. 


We would now like to determine whether there are quadratic | 
m.b.f.'s which require a number of a-gates which is close to the 
upper bound provided in Lemma 3.5.° It is easy to demonstrate a 
quadratic m.b.f. in B, for which (a(n) a-gates are required. 
For example, suppose n is even and consider the function 


Fy (xq 2++ + 9X,) = (x 4X5) v gg) Ne Vv (xq 1% q)3 the proof that 


T We remark that the number of prime implicants of a quadratic m.b.f. 
correlates poorly with its a~complexity since (as we show in Chapter 
5) the function Tp» which has () prime implicants, only requires 


Tiog n! a-gates to compute. 
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fg requires n/2 a-gates to compute is relatively straightforward. 
It is an open question whether Qc (n) is asymptotic with n. 
Our objective is to show that the constant factor of the trivial 


lower bound of n/2 may be improved to 2/3. 


3.6. Theorem: Suppose n= 2. There is a quadratic m.b.f. eB, 


such that MC (f) = |2n/3 . 


Proof: We define f as follows: partition the n variables 


into k= ‘n/3! subsets Sy So a0-098 » each containing 3 variables 


kK 
(except possibly S,). We consider the function 


eo T3(S) et Yee, T3(S)) (2) 


2 


where c (S;) is the threshold 2 function of the variables in S. 
(for the set Sp if n is not evenly divisible by 3, we consider there 
to be 1 or 2 "dummy" variables Set to the constant 0). We 
claim that MC (Ff) = pn/3;, and prove this result by induction on 
k in the event n = Q(mod 3). The other cases n = 1({mod 3) and 
n = 2(mod 3) are proven similarly. We first show that 
MC (f) > 2n/3 . 
Suppose k=1 and n= 3. Then F(x, »Xo 0X3) = Teter le 
By Theorem 5.9 in which it is shown using other means that 


MC (15) = Mog il, we have MC (f) = log, 3 = 2. 


Now assume the statement true for k and n = 3k -- namely 


that the function 
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f(x; s+e00X,) = TAS) ve v TE(S,) 
en as i 2‘~k 
has Mc, (f) 2 2k. We are concerned with the function 
, ri = 2 ? : 3 oan 
-f (Xy 9+ ++ 9X) s¥q Vo V3) — F(x, ++ +9X,) v T3(y, »¥o0¥3) 


where we have re-named the variables for ease of description. 
Suppose N is an M-circuit computing f*. The general 

aim of the proof will be to show that there are constants ay 85 

and a, in {0,1} with the following property: if y. is 

set to a, (for i x 1,2, and 3), then the resulting civeutt N- 

computes f; moreover, it is possible to eliminate two unnecessary 

A-gates from N* since each gate has at Teast ane predecessor 


which computes the constant function 1. We proceed in a fashion 


similar to Paterson [1976}. 


Definition: Suppose. N is a’circuit which computes f*, and 
. suppose i,je{1,2,3}. We say a node G of N_ has property Pig 
iff Yi 5 is a prime implicant of Res(G,N); 


Definition: If P is any property on nodes of N, we define the 
initial occurrences of P, denoted I(P), to be 
I(P) = {GeNodes(N) | G satisfies P, but no predecessor of & 


satisfies P }. 
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3.7 Lemma: Suppose N is an-M-circuit which. computes f-, and 
{,je{] 2253} 3 | ? ie Then 
(1) (Py 3) #8 


and (2) I(P, ) consists solely of s-gates. 


Proof: Since the input variables do. nut catisty fg and the 
output gate of N does, by passing up the circuit | N from the 

output gate we must eventually find a gate which satisfies P. 7 

but none of whose predecessors do. This ‘establishes statement 

(1). For statement (2), suppose that S is an v-gate which sat-— 
isfies Pi i? and let Pred(G, Ny = 0, kK}. by the remarks in 

Chapter 2 on prime implicants, we know ‘that PUG, N) s g PI(I.NUPI(KN), 
and since y,¥ePL(GAN), one of J or K must satisfy Py. 

. Hence GfI(P, 5). 


0: Lemma 3.7 
3.8 Lemma: Suppose N,i, and. j are ‘as above, and© Gel (P, Rie If 


Pred(G) = {J,K}, then either y,ePI(, N) ‘and ysePI(k, N), or 


vice versa. 


Proof: By Lemma 3. 7, if Gel (P, J)? then G is an a-gate, 
Every prime impTicant of G is a product. of a wexber of PI(d) and 
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a member of PI(K). Since neither PI(J) nor PI(K) contains 
Yy°¥j> we must have either y;ePI(J) and ¥yePI(K), .or vice versa. 
| | O tewma 3.8. 


3.9 Lemma: If N is as above, then | I(Py o)ul (Py g)ul(P, 3) = 2. 


Proof: Suppose to the contrary that there is only a single gate 
G in 1(Py p)uE (Pp 3)ul (Py 4)s. and suppose Pred(G,N) = {J,K} . 
By Lemna 3.8, we may suppose w.1.0.g.. that: ¥<PI(3,N) and 
YoePI(K,N) since GeI(P) 5). Again by virtue of Lemma 3.8 and 
the fact that GeI(P, 3), we must have y,ePI(J,N) since 
YoePI(K,N) implies that y,ePI(G,N), and hence that YjYoéPI(GN), 
contrary to the assumption that Gel(P) 9). But then G¢I(P, 3) 


since 193 Cannot be a prime impifcant of 
Res(G,N) = (yyy hh (yh). This contradiction implies that the 
union contains at least 2 gates. 


O Lemma 3.9. 


We are now ina position to prove the main theorem. Since 
there are at least two distinct a-gates in I(P) p)ul(P, 3)ul(P, 3) 
and since none of these sets is empty by Lemma 3.7, we may W.1.0.g. i 
assume that there exist distinct gates G and H in N such that 


GeI(P, 5) and Hel(P, ,). Assume using Lerma 3.8 that 
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Pred(G,N).= {Ky »K5} and ¥pePL(Ky oN)3 -and similarly that 

Pred(HsN) = (KgsK,} and yoePI(KysN). (It may be ‘possible that 

some of the .K hades: are not distinct.) The assignment of the 
variables y, = 0, Yo * 1, and y3* 0 results in'a circuit N° 
which computes the function f. Moreover; in N° -the inputs — Ky 
and K3 to G and H respectively now compute the constant function 
1, and hence gates G and H. may be eliminated ‘in N* sthce 
Lageg . for any Boolean function g.-' Hence if the resulting 


circuit is called N°”, we have . 
MC CN) = MC, (N°*) +22 Mc (f) + 2. = 2n+ 2 


by induction, and.the lower bound ‘is complete. 
.To show that C tf) < (2/3)m , tt suffices: to observe that: 


T(x, »Xo 2X3) - ((x) vx5) 4x3) NM "(a 4%) Wo ; 
and hence ¢, (79) < 2. Thus using the definition (2) one can 


compute f using (2/3)m a-gates. 
| 0 Theorem 3.6 


3.10 Corollary: Suppose n 2 4. Then 


12/3)y. < Qc.(n) < Q@ (n) sn-2. 
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Section C. A Graph Problem 


~ We close this chapter by describing a graph-theoretic problem 
which is related to the »-complexity of quadratic m.b.f.'s when 
they are computed with a restricted class of M-circuits. The 
research in the remainder of this chapter was done in conjunction 
with Ronald Rivest. We introduce notatfon similar to Harary [1969] 


for undirected graphs. 


Definition: An undirected graph G is a pair (V,E), where V is 


a finite set whose members are called nodes or vertices, and 
Ec{BcV | [B] = 2} is ; set whose members are called edges. _ 
‘A bipartite graph is a graph G = (V,E) whose vertices can 
be partitioned into two non-empty sets V) and Vo such that if 
edge {Vy sVoheE, then vieVy * VoeV5 5 that is, each edge dansists of 
one point in V, and one point in Vo. LSE ee - 
A complete k-% bipartite graph is a bipartite graph 
(V= VjUVysE) such that |W =k [Vol =. and 
E = {{¥) Vo} | vyeVys voeVo} . 


Definition: If G= € (Vy Eq) s{VnsEo) +++ 0(Vy Ep} is a collection 
of undirected graphs, and G = (V,E) is an undirected graph, then 
an exact covering C of G by G is a collection of one-to-one 


mappings h.:¥; + V_ for fe[1:k] such that E = se expen) lecE «} 


-279- 


= 
Fe 


where h({a,b}) = {h(a),h(b)} for any edge fasbheE,. The’ cost of 


cover,. denoted B(c), is k. 


not bipartite bipartite complete 


:' 3-3 bipartite - 


Fig: 3.1 -Some Graphs — | 
_ Points represent nodes and lines: represent edges. 


We can now define a measure on graphs. 


Definition: Suppose neN , and G is an n-node undirected ; 


graph. -Then 


- D(&) = min{B(c) | C is an exact cover of G by {G)5.--sG)}5 
where G; isva complete bipartite graph. 
— for ief{1:k]} . 


Define H({n) = max{D(G) | G is an n-node undirected graph}. 
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The connection between quadratic m.b.f.'s and graphs is as 
follows: Suppose f is an n-variable Boolean function. We can 
associate a corresponding n-node undirected graph Ge = (V,E), . 
where V = {1,2,...,n} and E = {{i,j} | X4X5ePI(f)} 7 Conversely, 
suppose G= (V,E) is an n-node undirected graph. By possibly 
renaming the vertices, we may assume V = [I:n]. Define the Boolean 


function fg:{0,1}" + {0,1} by 


F(x, 0+ +X.) = VAxs SX, | {i sJ}eE} - 


The functions f + Ge and G+ f, Clearly demonstrate a 
one-to-one correspondance between the set of quadratic m.b.f.'s in B, 
and the set of n-node undirected graphs with at least one edge. 


See Figure 3.2 for some examples, 


2 
| 2 5 8 
3 of 4 67 Tax 
ps T(x, Xp 2X—) V T3(x Xr 9X) V T3(x XaeXa) 
2° aaa a Aaa 244 e5P'6 2°°7°"8°"9 


Fig. 3.2 Some Functions and their Graphs 
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In fact, we can get a more strict correspondence in mathematical terms. 


Notation: Q = (f | f is a quadratic mb.f. of n variables}: u- 
{0, the constant function 0 of n variables}. 
G, = {6 = (V.E) | G. isan undirected graphs and V = [1:n]}. 
Z, is the graph ([1:n],9)° in Ge: | 
If G) = (V,E)) and G, = (V.E,)e@., define GuG, to 
be the graph (V,E,uE,). 
If Ais any set, and f:A +B, ‘then f(A) = {f(a) | acA}. 
If G=(V,E) fs an undirected graph, and h:¥ +4, 
then h(G) is the graph’ (h(¥)h{E)).” 


3.1] Lemma: Suppose n 20. Then 


(1) (Q, +0 ) and (G, susZ,) are monoids, 


and (2) the function f + G_ is a monoid isomorphism whose 


inverse is the map G + fg. 


Proof: Statement (1) is easily checked, as are the facts that 


the maps in (2) are ofie-to-one, onto, and inverses of each other. 


Moreover, if f, and FQ, then 


PI(f,vf,) = PI(f,) U PI(f,) 
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since each prime implicant is the product of two variables. Thus, 


if 
a = ({1:n],€), as, = ([1:n],£,), and SF = (Li:n],E,), it 


is easy to check that £ = EyuE,; and the claim is veri fied. 


O Lemma 3.11 


3.12 Corollary: Suppose C = (hy >++- shy) is a covering of the 
graph G by graphs G, Gp... 6). Then 

f.=f v f v wee vf, i 
G h, (6, ) ho (G5) h, (G,). 
Proof: Proves inductively using Lenma 3.11. 


0 Corollary 3.12 


We now connect the complexity of functions and their graphs. 
We will say an M-circuit or. M-formula Nis single-level if 
there is no path between any pair of a-gates in N. | Obviously, any 
quadratic m.b.f. can be computed by a single-level .M-formula and 
hence a single-level M-circuit. In a single-level M-circuit Ns 
each immediate predecessor of every A-gate consists of the 
disjunction of some set of input and constant nodes. Moreover, if 


N computes a quadratic m.o.f. f, then there is some subset a of 
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the a-gates of N= such that f = ;y Res(AsN). ~ Thus, given a 
single-level M-circuit N computing a quadratic m.b.f. f, one may 
construct a single-level M-formula F which also computes f and: 
which has the same number of A-gates ast N since computing any: 
number of dis junctions of variables takes no angates 


If f isa quadratic m.b. f: “define 


SLC, (f) = nint4c, (N) N isa single-level M-circuit which computes f}. 
We shall prove 


3.13 Theorem: Suppose f is a ue’ m. b. oie Then SLC, (f) = 
D(G 3 : : 


5): ae 
Proof: Suppose f is a quadratic m.b.f., and that 

D(Ge). = k. Let C= (hy>--- shy) be an exact cover of Ge by 

complete bipartite graphs Gy oee-sG,. We will construct a single- 

Tevel M- formula for f which. has one. A-gate for: each.graph in 

the cover of Gp. Suppose G, = (Va.= MyquYapeEq):- is the decompo- . 

sition of the vertices of OG, as in. the. definition of ; bipartite 

graphs. 


Note that h, (6, ) is also a ‘complete bipartite sraph. Since each 
graph is complete, 
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> peed : 
Fra, ) = VIX 5Xp | {J k}eh, (E,)). 
"(VES | dehy (Viy)I) 0 (Exy | keh, (V45)3)- 
Hence, using Corollary 3.12, we can express f as 
= = Vv 
k : 3 
= My [Mexg | dehy (Vp )9) * (Voy | keh (V,0)4)1 


which is a formula with k A-gates. Thus SLC,(f) <k. 


Conversely, suppose f is a quadratic m.b.f., and SLC (f) = k. 
By the remarks preceding the theorem, there is a-single-level 
M-formula F for f such that MC (F) = k. Since F is single- 
level, we may write 


k 
Fev TV x) a (¥ x) 
i=] eA, J keB, k 


for some subsets A, and By of [I:n] for'each jief[1:k]. Since 
PI(f) has no single variable terms, we know that A.nB, = 9 for 
each jie[1:k]. Thus 


.k 
f(x) = Xk | JA, and keB;}, 
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So if we define G. to be the. complete’ bipartite graph with vertex 
set A,uB, (decomposed as A, and .B,) for each ic[1:k], we 
Observe that C = (id,id,...,1d) is an exact cover of G, by 
complete bipartite graphs G, G55---sG,, where id is the identity 
map. Hence D(Ge) < k. . | : 
O. ‘Theorem 3.13. 

If we define SLQC, (n) to be the maximum value of — “SLC(f) 
for qny quadratic m,b. f. of n variables; ‘then cTheones 5. 13, 
implies that SLQC (n) = H(n). Since any.singla-level-M+formila ie, 
is in fact an M-formula, we know QL (n) < SLQC (n). We note 
that the proof given for Lemna 3.5 actually yields a ‘single-level 
M-formula for any quadratic m.b.f., so 'SEQC,(n)‘s 7-1. Since 
QL (n), = e(n), we know also that SLQC {n) ©e(n). For the latter 
function, it was pointed out to the author by V. Chvatal™ that one 
can actually get asymptotically matching upper and lower bounds on 
H(n). bt aes a 


3.14 Theorem: SLQC,(n).= H(n), and H{h) ~ nn. 


We include a sketch of the proof of Theorem 3.14 in Appendix 1 


tprivate communication, 1977. Recent work by Bermond [1978] has 
tightened the bound given on H(n). 


By 0 
section D. Open Questions 


At the end of each chapter, we list some open questions pertaining 


to that chapter. 


1. Demonstrate a m.b.f. for which ML (f) + ML (f) < ML(f) - 1. 
2. Is QL(n) = a(n@/log n)? More generally, do o(n*) 
monotone formulas exists for all quadratic m.b.f.'s? QL(n) is 


obviously O(n2). 


3. What is the asymptotic behavior of QLy(n) and QCy(n)? 


Determine QL, and Qc, more exactly than given in Corcilary 3.10. 


4. If f is a quadratic m.b.f., is SLC (f) = ML (f)? Is 


A 
ML (f) = MC Cf)? 


5. More generally, is SLQC (n) = QL. (n)? 
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CHAPTER 4 
Combinations of Functions 


One fundamental problem of Boolean function complexity is deter- 
mining the relationship between the complexity of individual func- 
tions and the complexity of combinations of those functions. The 
relationship depends not only on the type of combination but also 
on the model and measure of complexity used. In this chapter, we 
explore several questions in this area. 

Given Boolean functions f. and g, we consider primarily the. 
combinations fxg and v(fxg). Results about the function a(fxg) 
may be obtained by duality from those for ulfea ys. We do not consider 
the composition of functions; this area has many interesting open ques- 
é tions. The measures which we consider are formula and circuit size 
over different bases. 

As Paul [1976 p.383] points out, one might expect that 
Cp, (fea) = Cp, tf) + Cp, (9) since the evaluation of f and g on 
disjoint sets of variables “have nothing to do with each other." This, 
however, is incorrect; Paul shows that for any e-> 0, there are ar- 
bitrarily complex functions f in Been such that 


Cp (fxf) < (1 + e)Cp (f). In addition, he exhibits. arbitrarily com- 
2 2 


plex g in B such that C, (v(gxg)) < (1 + e)Cy (g). 
2 2 


For M-circuits computing monotone functions, the results are 


drastically different. | M. Fischer proved that 
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MC(f»g) = MC(f) + MC(g) for any mbofc's FeB., os and geB, 
[Paul 1976]. It is sti11 unknown whether Hoth) + MC(f)4MC(g)+1. 
In Appendix 2, we present an peels ioh to Fischer’ s result, due to 
Galbiati and Fischer sche sah a sets.of variables on which f and 
g depend have one. variable. in. common. ole addition; we show that - 
Fischer's result holds. when v-gates: and. A-gates ‘are counted separ- 
ately; namely that. MC (fxg) = MC Af) + MC (gy). and duatly: 
MC, (fxg) = MC\(F). + MC (g).. 

We consider the question of formiTa size additivitys the only 
combination considered is v(fxg).. “Here, in contrast to Paul’ s 


results on circuit size, we show that formula : size : additive under 


v, namely 
La(vifg)) = Ag(f +L) 


for any basis © which contains v, and any non- ~constant Boolean 
functions feB, and gecB.. | - When counting individual types of gates 
in formulas over the monotone basis cs we show that 

ML Jtv( Fxg) = ML vf) + ML yg). +1 for avoterary fonotone functions 


f tn BL and g in 8, ; the corresponding question, 


ML (v( fxg) £ ML alf) + ML (a) remains open, even for simple classes of 


+In fact, Fischer shows that any witfimal M-circuft for fxg is 
composed of disjoint minimal circuits for f and g. 
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functions such as quadratic functions. oe | 
Finally, using fact (1), we demonstrate a gap between the. circuit 
and formula complexity of some specific functions. In particular, we 
show that. there is a sequence of functions whose Bo-circuit comp] ex- 
ity is O(n log? n), but whose B,-formula complexity is 
a(n®/log log n). A slightly larger gap between these measures for a 
particular function had been pvavinusiy shown using different techniques 
by Paul [1977]", who observed the implications of fact (1). 
The following chart summarizes the duerent state of what is known 


about these problems'’. 


tworst-case results for these measures show that most functions | 


eB, have Lp (f) = A2"/10g n) and Cp (f) = 0(2"/n). Hence for 
2 2 


most functions the ratio between these two measures is (n/log n), 


but the measures themselves are exponential in nv. 


TT paul's result on the sub-additivity of x in the measure Ce used 
2 


multi-output functions. 


-82- 


Combination 


Table 4.1. Summary of results. 


Key - ‘ADD: combination of minimal circuits/formul as 
‘gives optimal circuit/formila for combination 
SUB: fegation of ADD for some particular functions 
2: open : 


_ =: does not apply 


Section A. The Formula Size of v(fxg) 


In this section we show that one cannot economize when constructing 


a formula for v(fxg) in an arbitrary binary basis. 


4.1 Theorem: Suppose n and mew» FB, and 98, are non- 


constant Boolean functions, and @ is an arbitrary Boolean basis. 


Then 


Lo(v(fxg)) = Lo(f) + L(g). 


Proof: Suppose F(Xys0 0X s¥z ores Ven) is an 2-formula for 
V( Fxg) (Xp 206 6 9Xy Vy oe + + Vn) = F(Xy 9-06 9X, )VO(Y] 9+ + + Ng) Since f 
and g are non-constant functions, there are constants %e{0,1}" 


and Bef0,1}" such that (2) = g(B) = 0: Thus F(X, 906+ 9X) 9b) 
even when simplified by absorbing constants: into gates, is an 
“Q-formula for f(x)vg(B) = (X), and hence has at least Lo(f) 
occurrences of variables from {X] 2000 Xp}. Similarly, since 
F(a.Yy 2+ + Ye) is a formula for g(y), F has at least L(9) 
occurrences of variables from {¥]o++ 0, Since the sets 
{XpoeeeoX,} and {¥,5+--s¥,} are disjoint, F has at least 
Lo(f) + Lo(g) total occurrences of variables. Since this is 


true for any formula F for v(fxg), the theorem is proved. 


O Theorem 4.1. 


4.2 Corollary: If a is any basis containing v, and f and g 
are as in Theorem 4.1, then Lo(vifxa)) = Lo(f) + L(g). Particular 


examples are Q = Bo and a=M, 
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To obtain the next corollary, we state without proof the following 


technical lemma due to Paul [1976]. 


Notation: For geBs and keN, k > 0, define Vg* by the inductive 


rules: 
Vg! = g 
VK « v(gx(Va""!)5, 
(so VX is the disjunction of k copies of g on disjoint sets of 
variables.) - 
4.3 Fact: (Paul) : There is a constant a. such that, forall - k 
and neN, k2 1, and‘all geB» the following holds: 


Cp, (Vat) < aemax(n-2",nk log’ k) = 


We also need the following fact due to Krichevskii [1961]. 


4.4 Fact: There is an NgeN such that for every ‘> Ng», there is a 
function eB, such that Lp (f) 2 .2"7(2- log n). 
2 eed Whe 


We now can establish the following gap between circuit and for- 


mula complexity. 
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4.5 Corollary (Paul, Bloniarz): There are constants np and a - 
such that if m > Ny is a power of 2 and n= 2" /m, then there is 
a function feB, such that 

(1) tg Cf) 2 (22) 7(2m?-10g m) 

while 
(2) Cp (f) < aeme2™ 
2 


Hence L, (f) = a(n2/(1og log n)) whereas C. (f) = O(n log2 n). 
| 2 ) . 2 


Proof: For this proof, we assume that the basis of operators 
for all formulas and circuits is B.. Let No be as in Fact 4.4. 
There is thus a function gcB., such that L(f) > 2"7(2 log m). 
By induction on Theorem 4.1, it is.easy to show that L (Vg) = k-L(g), 


so if we define 
om, 2 
f= Vge /™), 
m, 2 m 
we know that L(f) = (2"/m°)+(2"/(2 log m)), 


On .the other hand, ‘by Fact 4.3, there is a constant «a such 
that | 


C,(f) < armax(m>2™, m2". (m-2 log m)2/me 
| = geme2" : 


The final remark. follows since f has n= (2 ym?) «m “variables and 


since m< 2 logan. . O Corollary 4.5 
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Section B. Monotone Functions 


We now restrict ourselves to the class of single-output monotone 
Boolean functions and formulas over the basis M. For the combin-~ 
ation v(fxg), Corollary 4.2 proved that monotone formula complex- 
ity ML is additive since veM. “As for counting individual gates, 
clearly ML (v(fxg))s ML (f) + ML (g) for any m.b.f.'s f and g. 
It remains an open question as to whether there: are: any non-constant 
mb.f.'s f and g for which the inequality is strict, The cor- 


responding question for ML, is answered by the following ‘theorem. 


4.6 Theorem: Suppose n and mel and FeB,, and geB. are non- 


constant m.b.f.' S. Then 


Me (v( Fg) Mf) + ML (g) #1 


Proof: The fact that ML (v(fxg)) < ML (f) + ML (9) +1 is 
easily verified by combining an v-minimal M-formula for f with 
one for g by an v-gate. To show the inequality in the opposite 
direction, suppose that F(X,¥) is a M-formula for -f(X)vg(y) 
with the minimal number of v-gates. We will prove that if H is 
any a-gate of F, then H. depends’ only on variables. from the set 


X= {Xyo-++2X,} or only on variables from, Y = LY poe Veghe 


t We remark that in a minimal formula, the notions of structural 


dependence and functional dependence coincide. 
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Supposing this for the moment, we show that the theorem follows 
from this claim. Let H = {H|H is avariable or a-gate of F such that 
Succ’ (H,F) contains no a-gates}. By assumption, no node in H depends an 
variables from both X and Y. We look at Suce (HF). If 
suce* (H,F) were empty, then since F is a formula there would be a 
unique node Hy in H. But then Res (Hp sF) = v(fxg). Since v(fxg) 
depends on variables from both X and Y, and by assumption Ho 
does not, this is a contradiction. Hence Succ’ (H,F) is a none. 
empty collection of v-gates, one of which is the output gate U 


(see Figure 3.1(a)); moreover 
Res(U,F) = He Res (HF) : (2) 


We will replace Succ” (HF) with a tree of v-gates which also 
realizes the function (2) and obtain a (possibly different) formula 
for Res(U,F). Let Hy denote the set {HeH|H depends on variables 
in X}, and let Hy = {HeH|H depends on variables in Y}. By assump- 
tion, the pairs of sets Hy and Hy is a partition of H. Form 


the subformulas 


_V 
Fy Helly H 
and 
= VY 
Fy Helly’ Y 


(where the disjunction may be associated arbitrarily), and finally 
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Oia = V V 
the formula F FyvFy - Since Res(F) (ye, Res (HF) v (yen, Res (H Fs 


_ F* defines the same function v(fxg). as. F. .Moreover, since F 


was V-minimal, Succ’ (H,F) contains: exactly. |H|-1 v-gates; 


(a) (b) 


Fig. 4.) The re-arrangement 


since |H|-1 v-gates were added in the reconstruction, F° also 

contains a minimal number of v-gates. But £(Xq 900+ 9Xq)VOCY4 96+ Ve) , 

= Res (F,) (Xs¥) vRes (Fy) (X.¥)« Since Res(F,) depends only on X and 

Res(Fy) depends only on Y, it is easy to show that f = Res (Fy) and 

g = Res(Fy). Since the subtrees Fy and Fy are disjoint, there 

must be at least ML (fF) v-gates in Fy and Mm (9) v-gates in Fys 

giving a total of ML (f) + ML/(g) +1 v-gates in F*, and hence in F. 
‘To show that all A-gates in F depend on only one of X or Y, 

we suppose to the contrary that some a-gate depends on both X and Y 


variables. Let H be a top-most a~gate with this property, i.e. H 
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is an a-gate such that H depends on some variable in X and some 
variable in Y, but any A-gate H* in Pred’ (HF) depends only on 
X-variables or only on Y-variables. We will use techniques similar 
to the first portion of the proof to restructure the formula and 
reduce the number of y-gates in F. 


Suppose that Pred(H,F) = {I,J}. Define 


A(I) = {GePred’ (1,F)|G is an a-gate or an input and all. interme- 


diate gates (if any) on the path from G to H are v-gates} 


and define A(J) similarly. Note that neither A(I) nor A(J). - 

is empty by our selection of H. Since the portions of the tree 
from A(I) to I and from A(J) to J consist solely of v-gates, 
there are |A(I)|-1 v-gates in the subtree from A(I) to- I and 
{A(J)|-1 v-gates in the subtree from. A(J) to J.. Functionally, 


we have 


Res (HF) = [¥4¢)(Res(GsF))] A.[QYiy)(ReS(GF))] © (3). 


We are now in a position to use Mehlhorn and Galil's lemma 


(Lemma 2.8 ) to reduce the number of v-gates in the formula. 
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Define 
Ay = {GeA(I)]G depends on some variable in X} 
Ay = {GeA(I)|G depends on some variable in Y} 
By = {GeA(J)|G depends on some variable in X} 
and By = {GeA(J)|G depends on some variable in Y} 


Observe that since H is a top-most a-gate depending on both 
X and Y, AynAy = BynBy = 9. We construct a new formula F* 
by replacing the subformula Fu in F by a subformuia Fo 
which realizes the function 


- = V Vv Vv acl Vv \ 
fo UC gen, Res (G)) A (gen, Res ())] vE(za,Rest@)) A (gen Res (G21. 


If none of Ay: Avs Bys nor By is empty, then we define 


Fo=[(V.F 


07 ~'gea,fa) * (cep Fal vga Fa) * (gta, Fa)] (4) 


X Y 


In this case Fo. has at least one fewer v-gate since the 
[A(I)| + |A(d)] -.2 v-gates we identified in deriving equation (3) 


have been replaced by 


A(J)]- 3 


v-gates. 


If, for exampic, Ay alone is empty, then we define F, to be the 
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obvious simplification of (4), namely 
o* ‘Gea, S) * (gee, 8). (5) 


In this case, Fy contains ({Ay|-1) + ({By|-1) = [A(I)] + [By|-2 
v-gates. But By #@ since H depends on both X and Y_ vari- 
ables by definition.. Thus |By| < {A(J)| so Fy has fewer 
v-gates then Fue 

If one of the other sets is empty we proceed similarly. If 
two of the sets are empty, then they must differ in their variable sets 
since H depends on both X and Y variables. So suppose w.l.o.g. that 
Ay and By are empty. Then fo = 0, the constant function zero, so we 
may Tet Fo be the constant 0. In this case the closest v-gate in 
Succ*(#,F). may be eliminated since one input to the gate is the constant 
0;-such a gate must exist since the formula F does not compute a constant 
function. 


ie: 
Hence in all cases F has at least one fewer v-gate than F. 
Now if we can show that F~ also computes f(x) v g(y), then 


“we will have a contradiction. But this is in fact the case. Observe 
that fp implies Res(H,F); in fact, PI(f) consists of those 
monomes of PI(H,—) which consist solely of X-variables or solely 
of Y-variables. Since PI(v(fxg)) has no monomes which contain 
both variables from X and varfables from Y, by Lemma 28 we know 


that each monome in PI(H,F) which contains variables from both X 
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and Y may be eliminated from PI(H,F) without changing the function 
defined by the formula F. Since the replacement of Fy for Fu 
eliminates exactly all such monomes, F and F* define the same 
function v(fxg). But since F* has at Teast one fewer v-gate 

than F, this is a contradiction to the v-minimality of F. 


O Theorem 4.6. 


Section C. Open Questions 


(1) There are several open questions listed in Table 1. Are 
any solvable in general? If the class of functions is restricted 
(e.g. to quadratic functions) are any of the questions solvable? 


(2) More generally, for arbitrary single-output m.b.f.'s f 
and g, is | 


MC(F(g(X) 9(¥),..-.9(Z))) = MC(F) + MC(g)  ? 
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CHAPTER 5 
The Monotonic Circuit Complexity of Threshold Functions 


One important class of Boolean functions is the set of symmetric 
functions, those which are invariant under any permutation of their 
inputs. Formally, a Boolean function FeB oom is symmetric iff 
F (Xz sXp0--+ 9X) = #(K4 (4) XqGp) 922 Xa (ny) for every. permutation 
n of the set [lin]. One interesting fact about the symmetric 
Boolean functions is that they are fairly easy to. compute by a 
circuit over a complete basis. The value.ef.any symmetric function — 
on an argument depends solely on the number of inputs which are 
set to 1. Since it is possible to construct a linear-sized circuit 
which counts the number of 1's in an input and outputs that number 
written in binary,.it is possible to use the output: of such a 
“unary to binary" converter to compute the value of the symmetric 
function on the input. The best known general: schemes for 
computing symmetric functions in this way over the complete basis 
Bo require ~6n gates in all [Muller and Preparata 1975)". 

As for lower bounds on the complexity of symmetric functions, 
Lemma 2.11 shows that any non-constant symmetric Boolean function —— 
has complexity at least n-1 over 8B, since the function depends 


: We remark that using a similar idea, one also obtains a fairly small 


3.97 .66y) 


formula (size O(n for expressing any threshold function 


[Peterson 1978]. 
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on each of its inputs. Schnorr shows that all but 8 of the symmetric 
functions of n variables have B, circuit complexity of at. least 
2n-3 [1974]. Recently, stackmever showed that at least half of the 
2ntl syometric-Boolean functions of -n variables have Bo circuit 
complexity of at least 5n/2 - 5 [197%]. The latter result, along 
with an argument of Paul which establishes 4 similar’ 5n/2 - 0(1) 
lower bound on the complexity of specific. functions [1977], are the 
largest known lower bounds on the Bo cfrcuit complexity of any 
single-output Boolean function other than bounds obtained: by 
diagonal arguments [Stockmeyer 1974]. * 

In this chapter, we consider the collection of monotone ~ 
symmetric Boolean functions, the threshold functtons. - The main 
object of study will be the monotone circuit complexity of these 


functions. Recall the definition of the threshold functions. 


Definition: Suppose k,neN. Define the Boolean function threshold 
k of n variables, denoted TyeB,» by Ty(x, se+- 0X) = 1 iff at 
least k of the inputs XpoeeeeX, are 1. 


In particular, 1 is the constant function 1 for all values of 


n, and ip is the constant function O for all: m:>n. Since 


n n 
aX nA_ 
Ty (x,) = NAS and Vs = {Xx 


t Recently, Schnorr [1978b] has announced a 3n - lower bound 
on the Bo-circuit complexity of a function. 
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these extremal threshold functions have circuit complexity over M 


of n-l. By observing that 


n 2 
Ty (xy. -»X,) Pal ee fx, TX 9 » coe x, )) 


we know that Ty and T" are duals of each other, so that in 


n-k+]1 
n e n preeet = n 
general BMT) CBT k+ and MC(7,) = MC(T 14.) for 


ke[O:nt1]. 

For circuits over Bos the best known results are the following. 
The general upper bound of 6n_ for the By-circuit complexity of 
any symmetric function obviously applies to the threshold functions. 
(This general bound can be improved upon for small values of k.) 


Stockmeyer's techniques show that C, (T?) > 2n + min(k, n-k+1) - 3 
2 - 


for any ke[2:n-1]. 

In this chapter, we present new lower bounds for the monotone 
circuit complexity of the threshold functions. For arbitrary threshold 
functions, we develop a general theorem using an approach similar to 


Stockmeyer, Paul, and others which shows that 
MC(T)) > 2n + 2 min(k, n-k+1) - 0(1) 


for any ke[2:n-1]. In particular, for the "majority function" 


Ther » we show that 
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MoT) 2 3n-7. 


Using different techniques, we also study lower bounds on the 
monotone complexities for the function. >. We present proofs of 


two results due to F. F. Yaor, namely that 


MC (TD) = 2nW4 and MC (T. 2) fog ni . 


We then show that these Tower bounds are not achievable simul tan- 
eously by proving that, for n a power of 2, any monotone circuit 
for Th with Syactly fog nl v-gates must have at least 
2n+4togn-9 v-gates. 

Before proceeding to proofs of the lower bounds, we first make 
some remarks on upper bounds for. the Mecircuit complexity of the 
threshold functions. As of the present, no linear Cin k and n) 
upper bound exists on we(T™) for arbitrary threshold functions. 
Hence the gap between the lower bounds mentioned above and the best 
known circuits are disappointingly large. For Fixed values of ks. 
L. Adleman has developed a scheme for constructing. an M-circuit for 


Th which has kn + o(n) gates. i 


t . 
Private communication, 1976. 


mt Private communication, 1976. 
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In particular, he shows that 
MC(T>) = 2n + 0(yn) 


and thus that the high order term of Yao's lower bound is exact’ F 
While Adleman's method yields the best known circuits for small 
fixed values of n, a result due to A. Yao and F.F. Yao on 
building selection networks from comparators [Yao 1974] can be used 


to show that, for fixed k, 
Mc(T?) < ‘Tog k#In + o(n). 


Finally, if k is allowed to vary. with n (for example if k = h/2 );s 
the smallest known M-circuit which .computes tT has O(n log? n) 
gates. This bound is obtained by using an M-circuit of this size 

to sort the inputs in increasing order (Batcher in [Knuth 1973 pp.. 
111-114]) and then observing that Ty (xq 900 29%) = 1 iff the ae 
largest input is 1. 


T We remark that in addition, Adleman (private communication, 1976) 
has proved a lower bound of 2n + a(yn) gates for any M-circuit 
which computes Ts has exactly logn Argatess and is synenrenous 


(for definition, see [Savage 1976] ,p124). 
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In this section we show that the general lower bound techniques 
used by numerous researchers (Stockmeyer 1977, Paul 1972, Schnorr 1974, 
Khasin 1970] for circuit complexity. can yield a Aarger bound when 
the basis of allowable gates is restricted to M. The basic method 
of attack is as félTows: To prove a lower bound ‘on the combinational 
complexity of a particular Boolean function f, one attempts to 
show that there is a set of variables {x, | 1eA ¢ [1:n]} and 
corresponding constants (c; | ieA, c,e{0, 1}) such that the restricted 


function f lx. icA is a function whose complexity is 
i : me pos 


= C, for 


known to be bounded below by some quantity. If, in addition, one 
can show that in any minimal B-circuit or formula which computes 
f, the setting xy = C; 
Chapter 2) k gates because these gates now have at least. one 


for eA allows one to eliminate (as in 
constant input, we then know that 


Ca(f) 2k + Co(f hy =c, for icA )s 
hence a lower bound on CA (f) can be obtained... This is the gener- 
al approach we take; in particular, a lower bound on the resticted 


function. f |, will be known by induction. 


= Cy 
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Before describing the inductive hypotheses needed to prove our 
result, we first informally describe the method. The largest 
lower: bound obtained from this method is for the majority function 
Toa . We will reduce an M-circuit N: for an. arbitrary threshold 


function iy to one for He by.setting the input x. toa 


n 
_constant. The setting x, = O results. in-a ctrcutt for po 
while the setting x, =} results in a circuit for rae . if 
the constant can be chosen arbitrarily and«stil} allow a large 
number of gates to be eliminated, then the constant will be 
chosen so that k~ is as close to {n-1)/2 -as “possible. 


Some notation will :be useful. 


Definition: The parity function, p:n> {0,1}, is defined by 


1 if n is odd 
p(n) = 


0 if n is even , 


The partial function 2:NxN .+ N_ is defined by 


£(nyk) = [nt1-2k| whenever ksn. 


The sign function, denoted Sign:Z~+ {0,1} is defined by 


1 if x20 
Sign (x) = 
0 if x<0.. 
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The function S is defined by S(n,k): = Sign{nt+1-2k) 
for nykeN, | a 


Note that 2(n,k) is a measure of the proximity of k to | 
(n+1)/2.. Observe that p(n-2(n,k)).= 1, and:that for fixed n, 
the solution in IN to the equation. a{k yn) = #2 18 -non-existent . 
unless r<n-] and p(n-r) = 1, in which case *k ® (nt}-r)}/2 


and k = (ntl+r)/2 are the only.solutions. 


Definition: Suppose-that k,2aN ,.2%< n-3,.and p({n-2) = 1. 
Define i eo {threshold functions Tel adnik)= 2}. . 


For completeness, we-defineT) .=.8) if. > n-3 
9 


or p(n-2) = 0. 


Define MC(n,£) = max{MC(f) | fe |}. 


We observe that by the above note, if % < n-3 and n-£ is . 
a ee Nag o - 
odd, then Tied Tentl-9)/2 Tent] +g)/2 } , and the two members of 
is 2 are duals of each others. 
The prinicpal result we prove is the following: 
‘5.1 Theorem: Suppose n,teN , p(n-2) = 1, and. & sn-3 (so n 23). 
Then MC(n,£) = 3n-max(2,1) - 6 . 
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Before proving Theorem 5.1, we observe the fellowing corollaries: 
5.2 Corollary: Suppose n23. Then MCc(T" n/a ? 23n-7. 


- 
Proof: T h/2) <n p(nt1) ae - 0 Corollary 5.2 


5.3 Corollary: Suppose n23, ke[2: n-F], k # (nt1)/2. Then 
MC(T, ) = 2n + 2(min(k,n- k+1)) - 7. 


Proof: Assume ke[{2:n-1] and k eee: Since 
a(n,k) < n-3 and p(n-2(n,k)) = 1; we have Ty Thin, k) - But then, 
since k # (n+1)/2, 2£(n,k) = |[n+1-2k| = max(nt1-2k, 2k-n-1) # 0 
and thus Theorem 5.1 implies | 


Mc(T)) > 3n - 2(n,k) - 6 


3n - max(n+1-2k, 2k-n-1) - 6 


2n + 2min(k,n-k+1) - 7. 


O Corollary 5.3. 


The property of classes Ti 2 which we will exploit is -the 


following: 


5.4 Lemma: Suppose n,teN, n> 3, and 2£ < n-3. af fel, y> then 


there is a constant se{0,1} such that 
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(a) f Ika ac ee 


(b) f [iy Bos € Tht, [2-1] Paes oe 


- We claim that is  S(nsk) 


Proof: Suppose hig “The 
satisfies the properties aan if the Jemma. “This is proven by o 


examining both of the possible members of Th, ‘ For example, if 


rs ee 2)/2 € Thee? 


then 3S(n,2) = T, so 
f | = Tl T 
aol (n-1-£)/2 © ‘n-1,2+1 


The other case is similar. O Lemma 5.4. 


We will use,ane additional lemma which gives information about 
the structure of B,-circuits ( or M-circuits) computing threshold 


functions. 


5.5 Lemma [Schnorr 1974]: Let a be the basis B. or -M, and let 
N bea minimal o-circuit for ae for Nz 3and ke[2:n-1]. 
Then there exists a gate GeGates(N) and indices i,je[1:n] such 
that 
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(1) Pred(G) = (xpx,) 
(2) i#d | 


and: (3) outdeg (x5) 22, 


~ Proof Sketch (For complete proof see [Schnorr 1974} or [Stockmeyer 
1977]): Stnee N is an acyclic graph, one can clearly find a 
node G and indices i and j satisfying property (1). Since 
N is minimal, i # j. We now prove that one of x, and x; must 
have outdegree of at least 2. Suppose to the contrary that G is 
the unique gate in Succ(x, )ySuce(x,), and suppose Res(G,N) = 
G(x; »X;)- ‘Then two of the values g(0,0), (0,1), and g(1,1) 


must be equal since they are all members of {0,1} . 


t Actually, Lemma 5.5 holds for any function eB, with. the property 


that, for any pair of distinct inputs x, and x 


j j’ 


fy, zc, | Co {0,1} 3] 2 3. 


1 


Xi ™ €o 


Such a function is said to satisfy the "2-3 property". Lower bounds 
on the complexity of functions with this and more generalized 


properties have been previously studied [Savage 1976]. 
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Consequently two of the functions © T i lx, ie. Ix, : 
, =O... x,=1 
X, : “ 


and Tr I,. = 7 “must be equal. « Since these -thtee functions:-are 
i 


=] 
*j 
distinct, this is a conbrantctiony, so one of Xj or ee must have 


outdegree at least 2. . “ D Lemma 5.5. 
We can now return to the proof of Theorem 5.1. 


Proof of Theorem 5.1: The proof is broken down into two 


stages: ; 
(i) the result is first proven for 2 = n-3 for all n 
and 


(ii) the result is proven for 2s n-5 by induction on n, using 
as basis part (i). 


Part (4): Suppose 2% = n-3. In this case, T, = «1,7 ae eae 


Lower bounds of 2n-3 on 3. and saith lin these 
functions have been proven previously by Schnorr [1974]. This 
proof is repeated in [Stockmeyer 197%]. In fact, the lower bound 
of 2n-3 on MC(T>) follows from. Theorems 5.6.and 5.9, which 

will be proven later in this chapter independently of these results. 


The lower bound on MC(T? 4) follows by duality. 
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Part (ii): Suppose that 2 < n-5, feT,, a? and N. is a minimal 
M-circuit which computes f. We break the proof up into several 


cases. 


Case 1: There is an input Xs with outdeg (x.) 2 3. See Figure 


5.1. 


Fig. 5.1 Case 1 


W.1.0.g., by renumbering the input nodes of N we assume that 
i=". In this case, by Lemma 5.4, there is a constant se{0,1} 
such that f =f re gage Th-1, [2-1] . By-setting x, to 
as in N, all gates in Succ(x, NN) may be eliminated, so the 
resulting contracted network N’ has at least 3 fewer gates than 


N. Hence MC(f) = Cy(N) 2 Cy(N*) + 3 


> MC(f*) + 3. 


If 221, then feT 1? 50 by induction 


n-1,2= 
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MC(f*) + 3 = 3(n-1) - max (£-1,1) - 6 + 3 


3n - max(2-1,1) - 6 


IV 


3n - max(2,1) - 6. 


Qn the other hand, 
if R= 0, then Fret yy 7° so 


MC(f~) + 3 = 3(n-1) - max(1,1) - 6 + 3 


= 3n-7 = 3n - max(2,1) ~ 6 


This proves the result if Case 1 holds. Assume in the 
remainder that Case 1 does not apply. Assume in addition that 
Xp oXqs and G are as in Lemma 5.5; w.1.0.g., we assume that j = 
Since Case | does not apply, outdeg (x, ) = 2, so we may let 


{G,ti}. 


Succ(x, ) 


Case 2: G is an a-gate, and H is an v-gate. See Figure 5.2. 
In this case neither G nor H is the output gate since 


Res (H,N) Ly =] is constant while f i 54 is not; similarly 
n n 


eg? t ie = 9: Hence we can assume that A and B 
n 


n. 
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are arbitrary members of Succ(G) and Succ(H) respectively. 


(Note that we are not assuming that A #8B.). See Figure 5.3. 


Fig. 5.3 Two Gates 
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We claim that A #H; that is, HéSucc(G). If it were 


(see Figure 5.4) then Res(H,N) = X V(X AX.) = xX, and the circuit 


Fig. 5.4 An Impossible Situation 
N is not minimal. Since Pred(G) = (x4 9x} » GéSucc(H) and thus 
the gate(s) A and B are distinct from G and H. (In fact A#B 
but we do not need this). 


By Lemma 5.4, there is some constant s such that the restriction 


! = 
xy or 


of G or H_ now computes the constant function —s, and hence at 


fo ef 4 ees jee" By setting x, to —s in N, one 


least one input to either A or B .is now constant. Hence the 
resulting network N~* , which computes f*, can have at least 3 gates 
(G, H, and one of A or 8B) eliminated from it. As in Case 1, this 


implies that MC(f) = 3n - max(2,1) - 6. 


Case 3: G@ is an v-gate, and H is an a-gate. 


This case is handled similarly to Case 2. 
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Fig. 5.5 Case 4 


In this case, the objective will be to show that by setting 
xy to the constant 0, at least 4 gates can be eliminated from N. 
Observe that by Lemma 54, f koe 0 € Tn-1,041-2 Ta, [2-1] 

As in Case 2, it is clear that neither G nor -H is the 


output gate since f ye = 0 is not a constant function. Hence 
n 


outdeg (G) => 1 and outdeg (H) => 1. Also HéSucc(G) since if 
this were so then Res(H,N) = KA A(X Ax, ) = X) AX, (see Figure 5.6). 
But this means that two gates, G and H, compute the same function 
x AXys 4 situation that never happens in a minimal circuit. Ina 
similar fashion, it is possible to show that GéSucc(H). 

We now show that there are at least 2 gates in Succ(G) u Succ(H). 


Suppose to the contrary that there is only 1 gate A in 


Fig. 5.6 Another Impossible Situation 


Succ(G) u Succ(H). See Figure 5.7. Let Pred(H) = {x J}, and 
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suppose that A_ is an op-gate, where ope{a,v}. By using the . 


identity (for any m.b.f. f) 


(x,A%;) op (x,af) = x, A (x, op f) 


we may obtain a smaller circuit by replacing the subcircuit in 


Figure 5.7 by the following one with one fewer gate (Figure 5.8); 


Fig. 5.8 The Rearrangement 


that is, eliminate gates G,H, and A; add gates B and € as 
above and replace A by @G in Pred(K) for every gate K in 
Succ(A,N). Thus, if N were minimal, there are at least two gates 
in Succ(G) u Succ(H) (see Figure 5.9). 

We now are in a position to show that at least 4 gates may Ba 
eliminated by setting x, = 0. Let N* be the circuit obtained 
by this evaluation, and note that Res(G,N-) = Res(H,N~) = 0. 

Hence all gates in Succ(G) uv Succ(H) have at least one input 
constant. Since this is at least.2 gates in addition to G and 


H, this means that at least 4 gates in N° have a constant input 
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and may be eliminated, yielding a reduced circuit -N~~. 


Fig. 5.9 “Two More Gates 


Finally, we show that this elimination of 4 gates yields the 
desired result. As mentioned f* = f Ix = 0 © Thet,e41-¥ Tels [e-1| é 


Since N°* computes . f*, we obtain 


MC(f) 


Cy(N) = 4 + Cy(N*) 
24+ MC(f*) 
2 4 + min(MC(n-1,2+1), MC(n-1;]2-1]) 


(since f*eTy it] U T =] . | 2-1| ) 
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= 4 + min(3(n-1) - max(2+1,1)-6, 3(n-1) - max(|2-1|,1)-6) 
(by induction) 


= 4 + 3(n-1) - max(2+1,|2-1[,1) - 6 
= 3n-£-6 > 3n - max(2.,1) - 6 


and the proof is complete for Case 4. 


Case 5: Both G and H are _ v-gates. 
This case is handled dually to Case 4. 


0 Theorem 5.1 


I 


We now consider a specific set of threshold functions, 1p 


for n 2 2, and demonstrate some known lower bounds on the monotone 
circuit complexity for this set of functions. We first obtain 

lower bounds on the number of v-gates and a-gates separately. 

The general technique used in previous sections of setting 

certain inputs to constants and eliminating gates will be used here 
as well. For the specific function To we will be abte not only to 


establish that a certain number of gates can be eliminated, but also 
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to determine what types of gates can be eliminated. 


5.6. Theorem: (F.F. Yao): If n 2.2. then. MC(T}) = 2n-4. 


Proof’ (Bloniarz): By inductior'on ‘fi. The éase’n = 2 is 

obvious. asa RA CFS, 

For larger values of n, we use an argument similar to that 
used previously. Assume the statement of the theorem | true for a “1, 
and suppose Nis an M-circuit computing 1 Gages ky ’ with | > 
exactly the minimal number MC, (T5) of | ¥-gates. - Assume further 
that N has the minimal total number of gates among all such 
civeutes with MC,(TD) v-gates. We show there is some variable, 
which by symmetry we assume is Xn? such that setting xy to 0 
results in a circuit from which at least two v- gates may be elin- 


ated. If the resulting contracted Gircuit is Ny then N- 


computes a (xpoeeeaX 1) ‘and’ we obtain the inequality 


MCy(T5) = MCY(N) 2 MC,(N“) + 2 


> MC (7-1) +2 (since N? ae 


Why 


and the inductive.assumption completes the proof. 


We use Lemma 5.5 to establish the. results, W;1,0.g. we. may. assume 


-that there is a gate G in N and inputs Xj and x. to N 


n 
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such that i #n, Pred(G) = {X,oX,} and outdeg (x ) 22. See 
Figure 5.10. Clearly by setting x to 0 at least two gates may 
be eliminated. We prove that at least two of these gates are 


v-gates, but first introduce some useful notation. 


Fig. 5.10 


Suppose that Gand H are nodes of N. Recall that a path from 
G to H is a sequence of nodes Gy Gy o-+- 2G {for k 2 0). of 


N such that G, = G, G 


0 GB, =H and G; ,,Succ(G; ) for jie[O:k-1] . 
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Definition: An a-chain from G to H is a path G ala 


aaa 4 

such that G. is an a-gate for all jie[1:k]. That is, an 
a-chain is a path in the graph of N all of whose nodes, except 
possibly the first, are a-gates. Note that a path consisting of a 


single node is an A-chain regardless of the type of node. 


Define 
T(G,N) 


{HeSuce (6,N) | there is an a-chain from G_ to H} 
and 


V(G,N) = {HeSucc (G,N) | H is an v-gate and 


Pred(H) n T(G) # @} . 


(where again we omit mention of N if there is. no ambiguity). 
The importance of these definitions is as follows. 


5.7 Lemma: Suppose N is an arbitrary M-circuit, and G and H 
are nodes in N. Suppose further that Res(G,N) = 0. Then 
(a) If HeT(G), Res(H,N) = 0; 
(b) If HeV(G), then H has at least one constant input 
and may be eliminated from N; 
and 


(c) If HeSuce’ (G) and Res(H,N) # 0, then every path from 


G to H contains a node in V(G). 
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Proof: (a) and (b) are obvious. To prove’ (ce), suppose 
Res (H,N) #0. Then, by ix): there fs no” a-chain from G to H. 
Thus every path. from -G .to. H contains at least one v-gate, and 
the first such is easily seen.to.be'a member of V(G). 
| O Lewma 5.7. 


Me “proceed with the proof of Theorem 5.6. Suppose U is the 
output gate of N, and let -N be the cieculé*obtained when the 
constant 0 is substftuted for “X,+ By Lenma 5.7(b), this setting 
x, = 0. allows us to eliminate all the v-gates in V(x, 38). from | 
N°; if [V(x, sM)[ 2 Z; then thé proof is completet. We therefore. 
need consider only { V(x, ie er a a 

Because uy) depends on x,, we know that UeSucc” (x, .N). 
Since Res(U,N*) is not the constant function 0, we know that 


there is at least’one. gate in VAX,) “py Lemma 5.7(€). 


T Note that T and V_ are defined in terms of the structural 
properties of the circuit; in fact V(x, oN) = V(ZERO,N-), and 
T(x, sN) = T(ZERO,N“). We refer to these sets of gates as V(x.) 


and T(x,) respectively. 
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‘So we may assume that V(x,,): contains exactly one gate H. 
Note that UeSuce’ (HN) since N was-an v-minimal circuit for 
1 Suppose that Res(H,N*) = Q.. Again-usiig Lemma 5.7(¢), we 
know that V(H,N*) # @. -Since the graph of -N-2is acyclic, itis. 
impossible for H to be a member of V(H), so |{H}uV(H)| 22. By 
Lemma 5.7(b), all nodes in {H}uV(H) may be eliminated from N°, 
constituting at least two v-gates. 

We may conclude the proof by demonstrating that. the remaining 
case, namely when V(x,,) = {H} and Res(H.N74) #0, cannot» happen, 
Since H is a gate, we may assume that Pred(H) = {1.J} (where... 
it is possible that G is one of 1d. or H). Since. HeV(x,)s 
at least one. of 1 or J must. be in T(x,)3 wa. assume W. 1.0.9. that 
TeT(x,). The steps of the proof that this case cannot. agcur are 
embodied in the following lemma. 


5.8 Lemma: Suppose G, H, 1, and J are as above. Then 
(a) Every path from x, to U in N must pass through H; 
(b) J does not depend structurally on x, in N; 
(c) G is an a-gate; . 
and 


(4) eT (x,). 


Proof of Lemma: Result (a) follows from the fact that 
Res(U,N~) #0. By Lemma 5.7(c), we then know that every path from 
ZERO to VU in N* must pass through H; that is, every path from 
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Xy to U in N must pass through H. 

Since TeT(x,)5 we know Res(I,N*) = 0 by Lemma 5.7(a). 
Hence Res(J,N*) # 0 since Res{I,N~)vRes(J,N*) = Res(H,N*) # 0. 
Thus by Lemma 5.7(c), any path from x, to J would contain H, . 
the unique gate in V(x,)- Since JePred({H) by definition, and 
N is acyclic, we conclude that there-is no path from Xy to J. 
Hence (b) is established. 

If G were an v-gate, then G. would be equal to H and 
there would be exactly one path consisting of a single arc from 
Xn to H. Since outdeg (x,) 2 2, there is a gate G*°#G in 
Succ(x,). Since there is a path from G* to U in N’, there 
must be a path from G* to H in N. Since both nenber's of . 
Pred(G,N) are input nodes, this is impossible, so (c) is established. 

Finally, since G is an a-gate, V(G)c V(x.) By Lemma 5.7(c), 
we know that V(G) # @ since there is a chain from G to U in 
N* and since Res(G,N*) = 0. Hence HeV(G), $0 by definition at 
least one of I or J_ is a member of T(G). If. JeT(G), then 
JeT(x,) in contradiction to Lemma 5.8(b); hence IeT(G). Since 
T(G) c T(x;), the proof of (d) is complete. 

- O Lemma 5.8. 


Now let N-- be the circuit obtained from N (not N*) by 
setting x, to 0. Res(U,N°°), being the threshold 2 function of 
the remaining n-1 variables {Xy 900 5X) - tx}, depends on Xn 


By Lemma 2.9, we know that there is a path C in N° from Xn 
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to U such that every gate in C depends: functionally on x 
Since H is on every path from x to UW im~N by. Lemma 5.8{a), 
it must appear on-every path: in N** . and hence: on: €. Thus 
Res(H,N“*) depends on x in Nv. = 4 

However, by Lemmas 5.7(a) and 5:8(4}, Res(I1\N“*) = 0, so 
Res(I,N°*) does not depend functionally‘on. x,. -Stmilarly, by 
Lemma 5.8(b), J cannot: depend functionally on ''x, - in’ N, and 
hence in N**, so Res(J,N“*) does not dépend functfonalty on. Xn 
Thus Res(H,N~*) does not depend on Xas'a contradiction. 


Theorem 5.6. 


We can also get a lower bound on the number of 4-gates in 
any M-circuit which computes 1). Here the technfque used is 
similar, but.we must set more than one input to a constant to 


eliminate a single A-gate.°. 
5.9 Theorem: (F.F. Yao): Suppose n 21. Then. MC (TD) 2 fog, n. 


Proof: We prove this fact by strong induction on n. For 
n= 1 the result is trivial. 
Now suppose n> 2° and the statement of the theorem is true 


for all n° <n. Suppose N is. an -A-minimal Mscircuit which © 
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~ computes To Since n22 and since x, 4P1(T5) for any index 
ice[i:n], N must contain at least one a-gate. Let G bea. 
top-most A-gate in N; that is, G is an a-gate such that every 
gate in Pred’ (G) is not an a-gate. Clearly such a gate exists: 


since N is acyclic. Suppose Pred{é) = {H,J}. See Figure 5.11. 


Fig. 5.11 A Top-most a-Gate 
Since G is a top-most a-gate, al] gates in Pred* (G) are 
either v-gates or inputs. Hence Res(H,N) = F Vax, ~ and 
Res(J,N) = vB% for some non-empty subsets A and B of [1:n]. 
Observe that 


Res (GN) = (5 Yin x) © (yey. (x44%5)) (1) 
jeB-A 


Let C denote the smaller of the sets A-B and B-A and let c 
denote |C|. Then one can easily check that c<n/2. By renum- 
bering the inputs , we may assume w.1.0.g. that C = [n-ctl:n]. 

Let N* denote the circuit which eesaiee when the entire set 


of variables {x, | icC} is set to 0. Note that N ~ computes the 
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function ip Ix, = 0 for ieC = ple . By (1) we know that 


Res(GsN“) = 4 9X; 


We now modify N° to eliminate gate G. Construct a new 
tree, of v-gates and input nodes which computes the function: 
ea at agate G’. Eliminate G from N*, and replace G 
by G* in Pred(k) for al} gates Kesuce(@.N° ). (If G were 
the output gate of N, then ‘@ wid] now Be the output gate. ) 

Call the resulting circuit N*~. ‘Singes G has been replaced in 
N“* by a node which computes the same! function, N“* also computes 
a . Since N** has one fewer . negate than N and since N 
had a minimal number of A-gates computing Tp we we know ‘that N°? 


does not. comets ph and thus c21. Hence 


Mc,(T>) = MC,(N) = 1+ MC,(N“*) 
2. 1 + MC ATE) . 
=1+ les) (by induction) 


1+ tog(n/2) as c < n/2) 


V 


“Tog(n) 
and the theorem is proved. 


O Theorem 5.9. 
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In preparation for future results in the chapter, we note the 
following additional facts gleaned from the proof of Theorem 5.9. 
Suppose n is an exact power of 2, and N is an M-circuit for 
10 with exactly log n a-gates (which we later shew is possible). 

If G is a top-most a-gate of N, then if c (as defined in the 
proof of Theorem 5.9) is less than n/2, the setting of the variables 
{x, | ieC} to 0 leaves a circuit N° for a function ne where 

n*> n/2, which has 1 fewer a-gate than N. But this is impossible 


since then 


log n = MC(N) 
2 MC(N’) +1] 


> logn’ +1] 


= Tog (2n*) 
> logn . 


Hence we have proved 


5.10 Corollary: Suppose that n= 2” for reN , and that N is 
an M-circuit for ip with exactly log n a-gates. If G isa 
top-most: a-gate of N= and Pred(G) = {H,J}, then there is a 
partition of [1:n] into two sets A and B_ such that 


[A] = |B] = n/2 and such that Res(H) = AX 


: Wy = V. 
i and . Res.(J) 5eB 5 
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We proceed with our discussion of by demonstrating that 
each of these lower bounds is attainable in some M-circuit comput-— 
ing ie 

An M-circuit-with exactly 2n-4 v-gates for. 1) (for nee 2) 
is obtained by use of the foltowing recursive constructions. We 
will show inductively that, for. n 22, there is a two-output 
M-circuit on inputs (%po-+eeX,} containing 2n-3 v-gates 
which computes To (xyes. oX,) and Ty (xy eK) for n = 2, 


the circuit is given in Figure 5.12. 


Fig. 5.12 A circuit for n=2 


Assuming the fact true for n, we may construct such a circuit » 


for ntl variables with 2 additional v-gates (and an. a-gate) 
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by use of the recurrence relations . 


? n 
i (Xp sXopoe0e Xp sXi 47) = Tobey oe x) v (Xn4] A T(x] 2+ ++ Xp) 
and 


n+] = n 
Ls (X] 90+ + Xn eX 47) = Xae] V Ty (xy 9-5 + 9Xy)- 


See Figure 5.13. 


Fig. 5.13 The Recursive Construction :, 
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A circuit for 1 with exactly 2n-4 v-gates may be thus 
obtained since at the last step in this construction of To there 


is no need for the v-gate marked by * in Figure 5.13. 


5.11 Corollary: For n 22, MC (15) = 2n-4,. 


As for exhibiting an M-circuit for i with exactly _Tog,nt 


a-gates, we first describe a general method for.such a construction 
which was communicated to the author by F.F. Yao and A. Yao. For 
the remainder of this chapter it will be convenient to have the 


variable indices begin with 0 rather 1. ead stares 


Definition: Suppose %-eN . For 0s 2% < 2" and 1 < js 2%, define 


(i), to be the jth digit in the binary expansion of i,' that is, 


1 if ifmod 25) 2 28-1, 


0. otherwise. 


Suppose neN . Define for each je[1: Tog (n) ¥] the following 
subsets of the variables {x, | ie{O:n-1]}. - 


n. aa | ines p Ys AL: 

A. = (x; | teLozn-1] and (4), 0} 

ne “ne 1 = 
and B; {X, | te[O:n-1] and (i), 1} 


-127- 


Define corresponding Boolean functions aze byeB, by» 


n 
(pron) Gea 
n = V 
and Ds (Xpoe + 2X07) BG 


(We omit mention of n if it is clear from context.) Finally, the 


is n ; 
radix-join function F cB, 2 fog W ‘is defined by 


n 4 ; 
Fo = (a+ bys aps barsees 8 fog » og at) 


An M-circuit which computes F" is called a radix-join network. 


It is easy to see the relationship between the radix-join 
function and the function "Threshold 2." In fact, with the 
definition given above. _ — 

flog nt 


tet (a, A bj). (2) 


=~ 
s 
1} 


To prove (2), note that the sets of variables A and B are 


disjoint for each <[1: og nl]. Hence, ‘tine ‘A be the 


function defined by the right hand side of (2), we know that h < To : 


n 
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Conversely, for any pair of variables x and: Xpo with k #2, 
there is some position j at which the binary representation of 

k and & differ. Hence x,XpePI(agab,), 50 a h. This fact 
being true for every k #2 in [0:n-1], we know that es sh. 


See Figure 5.14. 


Fig. 5.14. An M-circuit for rs 


Applying the decomposition (2) to complexities, we have the 


following: 
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5.12 Lemma: Suppose neN , and F" jis the radix-join function. 
Then 
. MC(TD) < MC(F") + 240g nl -1 


and MC, (TS) $s MC (F") + flog of 


Since. F" is defined only in terms of v-gates, MC (F") = 0; 


hence we have: 


5.13 Corollary: For n21, MC (T>) = log n, 


From the formula-definition of the radix-join function one can 
clearly construct a radix-join network with O(n log n) v-gates. 
A straightforward recursive construction yields a radix-join network 
with 3n~- 2 logn-4 v-gates, yielding an M-circuit for ip: 
using Lemma 5.12 with a total of 3n-5 gates. Several substan- _ 
tially different M-circuits for T) with this size-can be constructed, including 
the circuit in Corollary 5.11. Lb. Adleman proved that asymptotically 


fewer than 3n-5 gates are necessary" » and using. his. techniques 


t Personal communication, 1976 
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one can prove the following: 


5.14 Theorem: Suppose neN . Then 


Cryy(F"). s 2n. +.0(¥A). 


Proof: Suppose n>4 and let 2 = flog al. Let’ ny = 2/4 
and ny - h/nj! . We will use the following recursive expansion 
to construct an efficient radix-join network for n variables. Let 
X denote the set of variables {x,'| ie[O:n-1]}. 

Suppose that pe[0:n,-1] and qe[0:np-1). Define subsets 


of the variables X forsuch p and q by Lo = {x, | ie[O:n-1], 


i (mod nj) = p} and He = {xy | icLO:n-1],. ti/nyy=.q}. Informally 


if all variables are thought to have indices whose binary represen- 


tation is padded to length exactly £, then Ly .is the set of 


variables which have a binary index whose lower order 2/2, bits 


are equal to the binary representation.of p. Sinitarlys He is 


the set of variables whose binary index has a high order h/2 
bits equal to the binary representation of q. We have corres pond- 
ing functions L 


and .H. in BL where Ly is the disjunction 


p q 
of all variables in Loe and Ho is the disjunction of all vari- 
ables in Ho: Note that each variable x eX is a member of exactly 


one of the sets {Le | pe[O:n,-1]}5 namely the unique p such 
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that i (mod ny) =p. Similarly, it is easy to verify that the 
collection Hos o-+ + oH, -) 1S a partition of tha set X and that 
* 5 2 - , 


x, eH where q =. i/ny) Moreover, none of the sets Ho is empty 


q 
for ge[0:n5-1] since q°ny < nyno- ny <n and Xqen,SHq" 
We claim that the relationship 
n n No 
= 1 
Fo=(F (Lgslyrersoly aa)> F (Hee +s +H, 27) (3) 


holds, where we use the obvious isomorphism -between {0,132 and 
(0,1}72/2) (9,1 ;2(8/4) 


Assuming this for the moment, we describe an efficient radix- 


join network using (3). Using the definitions of Hp and Loe 


we construct an independent {v} - circuit (which happens to be 
fan-out free and is thus a formula) for each of the functions 


aoa aaa and the functions Hostiy eee e oF 24 - To count 


the number of v-gates which this takes, suppose [bo = i for 


pe[O:n,-1]. Then, since Ly is a disjunction of single variables, 
Ceyp(L) = do 1 since hy # 0.for each p. Hence, the total number of 
is 


v-gates required to compute al] the functions Loskyeseeoly 3 
] 


peLosn-1](%p")) * (HEo:ny-1] tp) =~ 


=n- nm 
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since each x,eX ts a member of exactly one-of the sets: be for some 


pe[O:n,-1]. Similarly, one can compute all the-functions H 


q 
for q<[0:n,-1] with-a total of n- ny: v-gates;. gtving a total 


of 2n - n, - Ny. gates in all. 

We now use the n Functions | tp, £90 -Pelginy cll 25 tnputs 
to a recursively constructed radix-join network FI, and likewise 
the n, functions Hg for qeLO:ng-1] as inputs to a recursively 
constructed radix-join network Foe, Using equation (3), this yields 


a circuit for F", (See Figure 5.15.) — 


Fig. 5.15 The Recursive Radix-join Network 
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If we let T(n) = Cy (F") then we have the recurrence rela- 


tionship 
T(n) < 2n - n) - ny + T(n)) + T(n5). 


Since n, s Yen and ny < vén and since T(4) =4, this recur- 


rence can be bounded by 
T(n) < 2n +. 4(n/2).!/2 + e(njay'44 + 16(n/2)'/8 +... 
(log log (n/2) terms ) 


log log (n/2) 
n+ 


y.. 
=] 


k 
k 


which establishes the theorem since the summation is O(vn). 
s n 
A fg 


It remains to prove equation (3). Suppose that AG Bi a 
and by are as in the definition of F". ‘Suppose that js w/y. 


n : a be 7 
_ = _ ec = UL?’ = roe 
Then A {x, | (i), 0} (p) Solp since (i) 0 iff (P); 0 
where i (mod n,) =p. Hence a? = ,.,\V L; 
1 dG 529. 
n 
responding component al conse MLavegnal 1): 
om? ny sees 
; | 
is the component b;' of F MLorssr thy ats -. 


» $0 ay "is the cor- 


Similarly bj 


A similar proof demonstrates that when | w/A < j <2, ai is 
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No 
the component as w/% of a 2(Hys- soley -1) and bi is the 


component b,2 2/2. 
O Theorem 5.14. 


Applying Lemma 5.12, we have 


5.15 Corollary: (Adleman) Suppose neN. Then 


MC(T>) s 2n + O(Yn). 
We wil? have more to say about this method in Section D. 


Section C - M-circuits for Threshold 2 which are  a-minimal 


SE Re SN 


In ‘the previous section, we demonstrated lower bounds of 2n-4 
and fog, n! on the number of v-gates and n-gates nespece ely 
fisceestry to compute iy bp an N-cireuit, and demonstrated circuits 
which achieved each of these bounds. “The question naturally arises 
of whether there is a single M-circuit which CcOmputes ip and 
which simultaneously achieves this mintmum number of A-gates and 
Nogeres, and in this section we show this is impossible for infinitely 
many values of n. Our technique will be to. establish larger bounds 
on the number of v-gates necessary to compute T (for na 


power of 2) for any M-circuit which. has exactly log n a-gates. 
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We begin by examining the structure of such circuits with 
exactly log n a-gates. In Section B, we saw that such a circuit 
may be constructed from a radix-join network. In this section, we 
will show that it is possible to extract a radix-join network from 
any M-circuit for T with exactly log n a-gates. For now, we 
assume that n= 2" jis an exact power of 2 (for some r > 2), and 
that N is an M-circuit which computes 1 has exactly r a-gates, 


and among all such circuits has a minimal number t: ofi v-gates. 


5.16 Theorem: Suppose n,N, and t are as above. Then an 
M-circuit R, exists which computes F", the radix- join function, 


and which has t - logn+1_ v-gates and no a-gates. 


Proof: We show how to extract R, from N. We first show- 
that there is a restructuring Ny of N which also computes 1p 
in which there is only a single a-gate on any path from an input 
node. to the output gate - that is, there are no a-gates ‘in 
Pred®(G.No) ySuce” (G,Nq) for any a-gate G. Call such a circuit 
a single-level circuit. Moreover, we show that it iespneetblé to 
construct Np witn no additional a-gates or v-gates. 

Suppose that N does not have this property, and let U 
denote the (unique) output gate of N. Let G be a top-most 
A-gate in N which has at least one a-gate below it; that is, 


Pred*(G,N) contains no a-gates and Succ’ (G,N) contains at 
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at least one a-gate. Observe that G#U by definition since 
Succ’(U,N) = 9. We show how to re-wire-the circuit so that G. is 
no longer on a path with-any other. a-gate. 

oe We first construct a circuit N-. as follows.° Add an. additional 
v-gate -U“, and set “Pred{U?,N*) = €6,U}. -In: addition “, for 

every gate HeSucc(G,N), replace @ in Pred(H) by the. constant 
node ZERO. See Figure 5.16. 


Fig. 5.16 The Reconstruction 


Then simplify this circuit to yield a third circuit N-- by 


simplifying any gates with at least one input a constant function. 
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Since outdeg (G,N) => 1, this removes from N~ a1 gates in 
V(ZERO,N*) (as defined in Section 38) and hence at least 1 v-gate, 
so N* has at most t v-gates and at most r a-gates. 

We claim that Res(U’,N~~*) Ts and hence N** has exactly 
t v-gates and r. a-gates. (Note that Res(U*,N**) = Res(U~,N*) ), 
To prove this, assume that @ = (Cy sCoaeeeeCp) is an arbitrary 


constant in {0,1}". We show that 
Res(U*,N*)(€) = T5(€). 


Since N had log n a-gates, by Corollary 5.10 we know that 
there are subsets A and B of [O:n-1] such that AnB = @ and 
Res(G,N) = iAkG%j . Hence ‘Res(G,N) ie Thus, in the case 
j<B ' ee 

that 5) = 0, we know Res(G,N)(€) = 0. In this case it is 
easy to prove by induction on length of paths that, 
Res(K,N)(¢) = Res(K,N~) (2) for every Kdlodes(N). Hence 
Res(G,N-)(¢) v Res(U,N’)(¢) = 0, and thus Res (U’,N-) (ce) =0. On the 
other hand, suppose Res(U~,N*)(¢c) 7 0. Then Res (U,N“)(2) = 
Res(G,N-)(¢) = 0 so Res(G,N)(c) = 0 since N and N* are 
identical on the predecessors of G. Thus, it is again easy to 
verify by induction that Res (K,N) (2) = Res(K,N“)(c) for every node 


KeNodes(N), so Res(U,N)(¢) = 0. Hence 
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(Res(U,N)(€) = 0) + (Res(U-.N-)(€) = 0) 


so Sr = Res(U,N) = Res(U’,N-). 


The effect of this rewiring is to remove G from any path 
which has more than 1 a-gate on it. Since the transformation 
decreases the number of a-gates with «-gate successors’, by recur- 
sively repeating this process for top-level a-gates with a-gates. 
below them, the result is a single-level M-circuit for 1 with 
r <-gates and t v- gates: Call this modified circuit No- We 
remark in passing that every path from an input to the output in 
Ng must have exactly one a-gate on it since P1(To) contains no 


single variables. 


We can now extract the radix-join network from Noe Let U 


denote the output gate of No» let G = {G - 4} denote 


17892 
the set of (incomparable) a-gates in No» and let Pred(G, )= {I, J; } 
for all ie[l:r]. See Figure 5.17. There is a gate in G on every 
path from INPUTS(N) to U. Since No is minimal, there is a path | 
from each node of G to U. Since there are only v-gates on any 
path from G to U, we know that Res(U) = eee. ereevel 


since No contained a minimal number t oe ee 5 we know that 


F 
Observe that for arbitrary nodes A and B of N ,if A 
is a member of Succ (B,N77), then A was a member of Succ (B,N’). 
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the portion of the circuit between the a-gates G and the output 
U consists of a tree of v-gates. Thus there are r-1 v-gates 


H = {H),.-.sH.)} (one of which is U) below G in the circuit. 


Fig. 5.17 The Circuit Ny 


Fix je[l:r] and let jic[O:n-1]. By Corollary 5.10, since No 
has r= log n a-gates, we know that there is a partition of the 
input variables X = {xy |ke[0:n-1]} into two sets A and B 


such that Res(T,) is the disjunction of all variables in A and 
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Res (J) is the disjunction of all variables in B. Hence, 
since the circuit of the predecessors of G consists entirely of 
v-gates, there is a path from the particular variable x. to exactly 
one of I. and J.. 
j j 
Moreover, suppose that x, and x, are two distinct variables 


' n ; n. Y 

of X. Since x-xX-ePI(To)s and since To g %G9;° there must be 

some gate G. such that X;XpePI(G5). Hence there is either a 

path from x to I, and a path from xX to J. or vice versa. 
With these observations, we may renumber the inputs as follows: 


If E is an input node to No» we label E by Xe where we 


define the index ic[O:n-1] for E bitwise by defining 


1 if there is a path from E to I, in Ng 


0 if there is a path from E to af in No 
for each je[l:r]. By the above observations, it is easy to verify 
that, since n= of each input node gets labelled with a variable 
from {x, | ic[O:n-1]} . Moreover, each input node gets labelled 
with a distinct variable since two distinct input nodes must have 
paths to different predecessors of one of the a-gates Gi We 
assume W.1.0.g. that this is the numbering of the inputs since ie 
is a symmetric function. 
Now let Ra be the circuit obtained from No by deleting the 
sets of gates G {all the a-gates of No) and ff (r-1  v-gates). 


Since Ry is an {v}-circuit, we know that, for any gate D in R, 
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\ * : : 
[x,ePI(D)] # [DeSuce (x,)]. Hence, if the radix-join function F" 
has components (a,b) sa5sb,....a,sb).),° then, for each je[l:r], 
a = Res(J54R,) and b; = Res(1,.R,). Hence R is a radix-join 
network for n-variables. 

Since R, contains t-r+1 v-gates, the theorem is proved. 


1 Theorem 5.16. 


Armed with this theorem, we proceed to obtain: bounds on the 
number of v-gates needed to compute the radix-sort function in 


{v}-circuits. We first prove a general Jemma about {v}-circuits. 


Defintion: Suppose N is an M-circuit. Define the center of N, 
denoted Z,,by 2, = {GeGates(N) | Res(G,N) = 0}. Note that 
Zy = unless outdeg(ZERO) 2 1. 


5.17 Lemma: Suppose N is a {v}-Circuit with inputs {x]»X52-+-X_} 
for which outdeg(ZERO,N) = 0 , and some subset A of inputs to 
N is set to 0, yielding a circuit N*. Then it is possible to 


remove from N~ at least 


¥ outdeg(I,N) + J (outdeg(G) - 1) 
leA GeZy/ 


v-gates in Succ*(A,N). 
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Proof: Recall that the operation "setting a variable x to 
0" ina circuit involves replacing x by ZERO in the predecessors 


of all gates in Succ(x). Hence outdeg(ZERO,N~) = } outdeg(I,N) 


IcA 
and it suffices to show that at least 
outdeg(ZERO,N~) + J}  (outdeg(G) - 1) (4) 
GeZy- 


v-gates in Succ” (ZERO ,N*) may be removed from N* . We prove 


this by induction on (Zy-l - 

If Zy- = $, then every gate in S = Succ(ZERO,N’) may be 
eliminated from N~ since each gate in S_ has at least one input 
constant. If |S] < outdeg(ZERO,N~), then by the pigeon-hole 
principle there must be some gate GeS with both nodes jn Fred(G,N*) 
being the node ZERO. But then GcZy.s which is a contradiction. 
Hence, at least outdeg(ZERO,N~) gates may be eliminated. 

Now assume that [Zy-| = k > 0, and that (4) jis true for all 
{v}-circuits with centers of size k-1. Let Gy be a member of 
I(Zy-), the set of initial gates of Zyes and suppose Pred(G) = {J,K}. 
See Figure 5.18. Since Res (Gy .N*) = 0 and Gy is an v-gate, 
we must have Res(J,N~) = Res(K,N*) = 0 by Lemma 2.1. Since J 


and K are not members of Z,- (by definition of G,), it must be 


N 
the case that J = K = ZERO. See Figure 5.19. 


0) 
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We construct a new circuit N from N~ by eliminating gate 
Gos and replacing G ) by ZERO in Pred(H) fer every gate 
HeSucc(Gp .N*). For every node LeNodes(N), we know that 


Res(L,N*) = Res(L,N) since Gp was replaced by a node which 
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computes the same function. Furthermore, |Zy| = [Zy-|- 1 

(since Go was removed from the circuit); outdeg(G,N) = outdeg(G,N~) 
for every gate GeZa; and outdeg(ZERO,N) = 

outdeg(ZERO,N~) - 2 + outdeg(G).N*). Thus, by induction, it is 
possible to remove from N at least an additional 

outdeg(ZERO,N) + ) (outdeg(G,N) - 1) = 


GeZy 


outdeg(ZERO,N“) +[ }  (outdeg(G,N*) - 1)]J- 1 gates from 
GeZ,,- 
N 


Succ” (ZERO,N) is Succ’ (ZERO N°), which, together with the elimination 
of Gp» proves the lemma. 


O Lemma 5.17. 
One additional lemma will prove useful: 


5.18 Lemma: Suppose new, n> 4, and n# 2°41 for any r-e N, 
n 
Let Ry be a minimal {v}-circuit for the radix-join function F . 


If i e[O:n-1], then outdeg(x; .R) > 2, 


Proof: Suppose n, Rae and ji are as in the statement of the 
n = a ga 
lemma. Let F = (ay 2b) 9+ -@ fog WI sb Tog 7 ). Observe that 
[PI(a,)| = Ih/y2! and |PI(b,)| = 0/4 and hence neither PI(a,) 
nor PI(b,) consists of a single variable. Since x is a member 


of one of PI(a,) or PI(b,), we must nave that outdeg(x, »R, ) = Ty 
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Suppose that outdeg(x, .R, ) = 1, and let Suce(x. sR, ) = {G} 


where Pred(G,R, ) = (x, He See Figure 5.20. Since N is a {v}-circuit 


Fig. 5.20 The Case outdeg(x.) = 1 


PI(H,R,) is a non-empty collection of single variables, so let 
x ePI(HR,) be chosen arbitrarily. Note that k # i since the 
graph of R is acyclic. 


It is easy to verify that in any {v}-circuit N, 
[KeSucc (L,N)] => [PI(L,N) ¢ PI(K,N)] ~ (5) 


for any K,leNodes(N). Thus, in this case, since 
Succ’ (x, »R,) = Succ’ (6,R,), we must have xpePI(K sR, ) for any gate 
KeSuce (x, 5R,). However, if we let j be a radix position such 


that Se fae (k) (which must exist since i # k), then one of 
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PI(a;) or PI(b,) should contain x but not “Xs assume W.1.0.g. 
that xjePI(a,) and x éPI(a,). Since R, \ computes FM, there 
must be a node D in Ry which computes aye Since n ? 2°41, : 
no component of Ff (including a;) is the projection of a single 
variable, so D must be an v-gate. Since D depends on x 
and hence is in Succ” (x; +R.) we must nave -mpePI(a,) which is a 


“igontradichions z r oO Lemma 5.18 


We are now ready to establish a lower bound on the complexity 
of any {v}-circuit which computes the radix-join function. 


5.19 Theorem: Suppose neN. Then Cry (F") 2 2n + 2Llogny - 8. 


Proof: The theorem is easily verified for ns 4. For larger 
values of n, we first solve the problem in the case in which n is an 
exact power of 2, say n = 2" for some: re .. -We proceed by 
induction on r; the cases r = 1 and 2 have already been disposed 
of. 

Assume the statement of the theorem true for r- 1, and that 
n a2. | | 

Suppose that F" = (a,sb,,.-+sa,.b,) and that Ris a 
minimal {v}-circuit which computes. FA Qur general objective will 
be to show that by setting half of the variables {x,sX92-.-»X,} 
to 0, one can eliminate at least n +2. v-gates from the resulting 
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circuit and have a circuit R- which computes- pre | Assuming 


this for the moment, we then know that 


n ss n n a 
C. (F)= Cy yy (Ry) zat2+ Cy (R ) 


iV 


n+24 paar 


n + 2 + 2(n/2) + 2(1og (n/2)) - 8 
(by induction) 


4 


2n.+ 2 logn - 8, 


thus proving the theorem. | 

So ‘tipnose Ry is as above, and that I, and dy are the 
nodes in Ry which compute a and by respectively, for 
ke[1:r]. 

By fact (5) of page 145, in Ry each output node in 
{1} Jy>-+- slid} must have outdegree zero since the sets of 
prime implicants of each component of F" are incomparable. In 
particular, if D, and. Do are arbitrary output agdace then 

{PI(D,)nPI(D,)| < n/4. Each output node is an y-gate since each 

| component of F" has n/2>1 prime implicants. These are the 
only nodes in R, with outdegree zero since R, is minimal. 


Let 


A= PI(IR,) = fx; | (i), = 0} 


and 
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wo 
tt 


PI(J3R,) = «x, | (4), = 1 


Observe that AnB 


It 


@ and |A| = |B] = n/2. Let Ra (resp. Re) denote 
the circuit obtained by setting all variables in A (B) to 0. 

It is easy to see that R, , with output nodes {Ty sdpseee ody sdnyts 
is a radix-join network on n/2 variables. Similarly Ry with 

the same set {I,sJjo---st yoda} of output nodes is a radix-join 
network of the remaining variables in 6 if the rth bit of the 
index of each variable in B is changed from 1 to 0. 

We show that nt+2 gates may be eliminated from one of these 
circuits Ry or Rp: In Ras the only gate with outdegree zero 
which computes the constant function 0 is Dai By Lemmas 5.17 
and 5.18, tais means that at least 2+(n/2)-1 = n-1 gates in 
Succ’ (A,R) may be eliminated from Ra (where the only node in 
ae being accounted for is Ls In addition, since 
outdeg(u,»Rq) = 0, Jy also may be eliminated since it is no longer 
an output gate in Ra. Since J,éSucc (A,R,) s it was not eliminated 
previously, so at least n gates may be eliminated from Rae 

In a similar fashion one may count at least n gates eliminated 
from Rp. 

To show that at least two additional gates may be eliminated 
from Ra or Ra we must look at other gates in the circuit. Let 
Pred(I sR.) = {Ky aby} and Pred(J_ sR.) = {Kp slp}. See Figure 5.21, 
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Fig. 5.21 The Predecessors 


We consider several cases: 


Case l(a): Ky and Ly are v-gates with outdegree in R 


more than one. See Figure 5.22. 


Fig. 5.22 Case 1(a) 
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In this case, two additional gates in Ra may be eliminated 
by Lemma 5.17. Both .Ky and Ly are members of Zp, and both 
A. 
are of outdegree at Teast two in Ry: The count above of -n 


gates considered only those members of Zp of outdegree zero. 
A aaa 


Case 1(b): Kp and Lp are both v-gates of outdegree in R, more 


than one. 


If this is the case,then in Rp two additional gates may 
be eliminated; the proof is handled as in 1(a) with A replaced 
by B. 


Case 2(a): Ky and La are both v-gates of outdegree one in Ry. 
See Figure §.23 


Fig. 5.23 Case 2(a) 
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In this case, both Ky and La in addition may be eliminated 
from Re: We have previously eliminated from. Re the unique gate 
Ls in Succ(K, »R,) = Succ(LysR,). Since neither K, nor La 
is an output gate in Rp» each may also be eliminated from Ree 


Case 2(b): Kz and Le are both v-gates of outdegree one in Ry 


Proved similarly to 2(a). 


Case 3(a): Cases 1 and 2 do not hold; one of {Ky skp} is an v-gate of out- 
‘degree more than one in Ris and one of {Kp Lp} is an 
v-gate of outdegree one in R,. (The other nodes might 


possibly be inputs.) See Figure 5.24. 


Fig. 5.24 Case 3(a) -- One Possibiltty 
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In this case two additional gates v-gates may be eliminated 


from Ra- Zp has at least..one gate of outdegree-at Teast two 
A 


since both K, and Ly “are members of Zp, » So Lemma 5.17 allows 
us to eliminate at least one addittonal gate. The v-gate of 
outdegree one in {K, Lp} may also be eliminated since its unique 
successor J, had been previously eliminated and since neithér 


Kp and Lp is an output gate in Ra- 


Case 3(b): Same as Case 3(a) with A and B_ interchanged. 
Proof is similar. 


Case 4: One of {Ky sla} is an input, the other is -an .. v-gate of 
outdegree one; one of {Kpsl,} is an input, and the 


other is an v-gate of outdegree one. 


Suppose w.1}.0.g. that Ka and Ke are inputs. Let 
Pred(L,»R,) = {M,N}. See Figure 5.25. At least one of M or N must 


Fig. 5.25 Case 4 
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be an v-gate, for if both M and N were inputs, then 
[PI(T sR) = 3; but |PI(a,)| =-(n/2) > 4 so this is impossible. 
Suppose w.1.0.g. that M jis an v-gate. . 


If outdeg(M,R, ) 2 2, then MeZp » SO in Ry one additional . 
A : 


gate in Succ’ (M,R,) may be eliminated by Lemma 5.17. Also Lp 
may be eliminated since it is not an output gate and its unique 
successor has been eliminated. On the other hand, if outdeg(M,R, ) 
= 1, then in R, both Ly and M may be eliminated in turn since 
each of their successors has been eliminated and neither is an 
output gate. 

The above list of cases exhausts all possible arrangements of 
Kas Ly» Kes and Lp; the other arrangements are not possible in a 
minimal {v}-circuit. For example, it is impossible for Ky to be 
an input and La to be an v-gate of outdegree more than 1. If 
so then since |PI(I,sR,)| =n/2, |PI(LpsR,)| = (n/2) - 1. 
However, the set of prime implicants of any ‘Gutput other than I, 
has at most n/4 < (n/2) -1 variables in common with PI(IR,)s 
and hence. the only output in Succ’ (Ly ,R,) is I, by fact (5) of 
oe 145. Similarly, it is not possible for both Ky and La 
to be inputs. 

This completes the proof of the theorem in the event nis 


an exact power of 2. 
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If n is not an exact power of 2, let r= jlog and 
let Ny = 2". Suppose that R, is a mintmal ‘{v}-circuit for . 

FD = (a) sby s+ +2847 ob iyy) and let I, and J, be nodes in R, 
computing ay and by respectively (for ke[Tirtt]}). Let B be 
the set of variables {x, | ie[ng:n-1}}. and observe that — 

PI(b i.) =B. If R* is the circuit obtained by setting all 
variables in B to0O, then R°, with outputs MI, Jy o---sT adit > 
“ds a radix-join network for no variables. : ) 

If n# Ny + 1, then by. Lemma 5.18 each input. xX, 6B has 
outdegree at Teast two in R,: Hence by Lemma 5.17 at least 
2|B|-1 gates in Succ’ (B,R,) may be eliminated from R’ (since 
Jit, Zp- is of outdegree zero). In addition, Tae which is an. 
v-gate since [PI(a.y)| > 1, may also be eliminated from R* 
since it is an v-gate of outdegree zero which is not an output of 
Ry . Since |B] =n - No» this means that a total of at least 2(n é No) 
gates may be eliminated from R*. 

If n=ngt, then PI(b,,) = {x,_,}- Since 
Xf PI(a,) and {PI(a,)| > 1, we know that outdeg(x . R,) 21. 


Since N is minimal, J is the input node es The setting 


r+] 


of x,_, to 0 results in a circuit R° from which at least one 
gate in Suce"(x,_15R,) may be eliminated (since there are no. 
gates of outdegree zero in Zp-). Moreover, Iy is again a gate 
and may be eliminated from R* in addition to the gate previously 


mentioned, giving a total of two gates which may be eliminated from R7. 
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In either case, R* may be simplified to be a radix-join network 
R** on no variables with the elimination. of 2(n-np) v-gates. 
Thus 
oe: | 7 
Co yy fF ye Coy3 (Rp) . Co yy (R ) + 2(n No) 
n 
0\: oot A 
2 C(F ) : a(n Ny) 
2 2no +2 log'ny - 8+ 2(n-no) (since 
Ng is a power of 2) 


2n + 2r - 8 


and the theorem is proven. 
Oo Theorem 5.19. 


5.20 Corollary: Suppose that n = 2”, reN , r 22-and N is an 


M-circuit for 1 with exactly log n a-gates. Then 


Cy(N) 2 2nt+4 logn - 9. 


[ocesiannaenneanllll eensemesimmeninanenenemetemeiateiomel 


section D - Efficient Circuits for Threshold Functions 


As mentioned, the proof of theorem 5.14 is a modification of 


a method first discovered by L. Adleman for computing threshold 
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functions. A description of this method, which proves that 
Mc(T?) < knt6(n) for arbitrary . fixed k, is to appear, and we sketch the 
technique briefly in this section for “threshold 2”. 

To compute Ts the n variables are arranged in a 
(/nl x Ynl = square matrix X. Let R, be the disjunction of the 
variables of the i row of X, and let C; be the disjunction 


th 


of the variables of the j~ column. Then observe that 


m1 » Ta |: 
Toucan et ss (Ry see Rea )v ae (Css pa) 


since each pair of different variables in the matrix differ either 
in their row numbering or their column numbering. © This construction 


give rise to the recurrence relations 
r 
MC(TS) s2n- 2 +1 weuc(T/ )+1 


and Mc(TS) = | 


since the row and column sums can be constructed with at most 
2n - 2f¥n 14+ 1 gates. This method shows that 
Mc(TD) < 2n + avn + O(n'/4), 

One problem with the above method is that it-does not necessarily 
yield an M-circuit for TD with the -minimum number. ‘log, of 
a-gates. For example, for n = 8 the resulting circuit has 


4 a-gates. The circuit for F" described in Theorem 5.14 uses 
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the same idea except. that the variables are arranged into a 
rectangle with sides which are an exact power of 2. While the 
method of Theorem 5.14 yields an M-circuit with exactly Nog nl 
a-gates, it does so at the expence of increasing. slightly the: .. 
total number of gates. A close examination of the recurrence 
relationship for Ce (F") shows that, tn the notation of the 
proof of Theorem 5.14, either n, < Yn and No s Yon sor 

n, s ¥2n and ny s Yn. Hence this method: yietds an M-ctrcuit 

for 1 with at most 2n+(1+V2) Vii + Gtn!/4) “\gites. - (For some | 
values of n. the recurrence fs ‘better;) © The table below describes 
the best known constructions for some small values of n.- 


Ae 


Cor. 5.20 Lower Construction of . Adleman's 
‘ Bound if log n Thms. 5.12.and 5.14 Construction 
: A-gates EEA ae tae oi 
(Total Gates) (Total) (a-Gates) 
| ae 71 
4 7 2 
8 19 4 
16 39 4 
32 79 6 
64 151 6 
128 290 8 


Table 5.1 The Complexity of ™ 
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Observe that. for n = 128, the most efficient. known M-circuit 
for i does not use the minimal. number. (tog 128.) of ‘awgates. 


Section E. Open Questions — 


There are a host of open questions raised by the above 
results: . 
(1) We know that Coy (Fo) s.2n + O(n). Determine 
Coy, (F") more exactly. 
(2) Does an optimal M-circuit for 13: shave exactly fog nl. 
a-gates? | . 
(3) Is there a minimal M-circuit for ip which is single-level? 
Is there one for an arbitrary quadratic function? 
(4) Is there an M-circuit of size c+n-logn for Ty: for 
arbitrary k <n where c is independent of k? Is 
there an M-circuit of size <og k + Th for Ty for 
fixed k? : * 4 . 
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CHAPTER 6 
The Complexity of Monotone Functions fin Other Bases 


The objective of this chapter is to study whether the complexity 
of a monotone Boolean function can be reduced by using a basis other 
than {a,v} in the formulas or circuits considered. That this is 
the case for Boolean circuits and multi-output :functions is eeTieeioa 
and several examples exist. For instance, Paterson [1975] and Mehlhorn 
and Galil [1976], extending work originally done by Pratt[1974], have 
shown that the M-circuit complexity of the Boolean matrix 
multiplication of two nxn matrices is exactly 2n® ~n*. ' On the 
other hand, Fischer and Meyer [1971] have shown that Strassen's fast 
integer matrix multiplication algorithm can be used to multiply two 


Boolean nxn matrices on a Turing Machine in time  o(n?-81 ++ 


). Using 
Fischer and Pippenger's results connecting time-bounded Turing Machine 
computations and circuits, this means that a circuit with o(n2-82) gates . 
for matrix multiplication exists in any complete basis’. “Paul [1976] 
has improved this gap by exhibiting a series. of monotone. functions 


nin 


feB for which MC(f) = 9(n7/ (1ogn)3/2) but for which Cp (f) 
? 
= O(n log? n). 


T Recall that the complexity over different complete bases is always 
related by a constant factor, hence it does not matter which complete 


basis is used. 
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It remains an open question as to whether such a gap between 
B, ‘and MC exists for a single-output function. Pippenger 
[1976] has shown that if one looks at the monotone function in B, 
with the worst MC complexity among all m.b.f.'s in 8, P then its 
MC complexity is no more than a factor of 0(10g n) larger than 
its CB complexity‘. On the other hand, several researchers have 
conjectured that a large, even exponential gap may exist between the 


MC and Mp measures for a specific monotone Boolean function. 


In contrast to this, it is possible to-show that there are 
specific m.b.f.'s for which allowing a complete basis in the circuit 
does not allow a saving of more than a constant :factor over the 
monotone case. This clearly the case for a function of minimal 
M-complexity which depends on all its arguments:, and hence has. 
M-circuit complexity of n-1. The situation can occur for more 
complex functions in addition. For example, in Chapter 3, we 
demonstrated that most quadratic m.b.f.'s of n- variables have 
B circuit complexity of a(n@/1og n). Since each quadratic 


m.b.f. can be realized in an M-circuit which has 0(n@/10g n) 


* In fact, Pippenger conjectures that among all monotone functions 
the worst case values for MC and for Ce are asymptotic, and 


offers a reward of $100 for proof of this conjecture. 


4 
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gates, it is not possible to save more-than a. constant factor. in 
circuit size for most quadratic functions. _ 

We consider the formula size of functions in the éaieitiidek of 
this chapter. Here, the choice of complete basis will have a 
greater effect on complexity as pointed out by the work of Pratt 
[1975] - we consider the bases N, By, and U. One may make comments, 
about the worst-case values for these measures for all monotone 
Boolean functions similar to tose Noleed for circuit Hiaslextties: 
it is the complexity relationship: for individual functions which 
concerns us here. We show that. this relationship can depend on 
the functions involved. Clearly, there are. functions of complexity 
n-1 in all three measures for which, na savings can be had.. In work 
done jointly with M. Paterson, we do show..that there are m.b.f.'s 
of M-formula complexity . e(no/log n) for which any formula in any 
complete basis must be of size “‘R(ny/1og n), and hence for which 
not more than a constant factor savings may be achieved. On the 
other hand, we exhibit a particular sequence of m.b.f.'s which 
have M-formula complexity a(n2), but for which linear sized formulas 
exist over the basis By. This sequence is derived as a corollary 
of Khrapchenko's result that. any U-formula, for the parity function 
must have size n2, While tidssexauole affirns that the complete basis B 
may allow more succinct expressions for some m.b.f.'s, it does not 
answer the question of whether the addition of negation to the basis 


M allows a more compact expression for some functions. This latter 


question remains open. 


2 
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Section A. A Case where a Complete Basis: Doesn't Help 


We will utilize a method introduced’ by Nefiporuk [1966] to 
establish a lower bound on the formilla size of a particular mono- 
tone function. This method has been used by other authors in ref- 
erence to other problems [Harper and Savage 1972, Paul 1977, and 
Lamagna 1975]. While the technique applies to formulas over arbit- 


rary bases, we restrict our treatment here to the binary basis Bo. 


Notation: Suppose . FeB. ts an arbitrary Boolean function of the 
variables X,,....X,, and suppose - Y etx, | ie{1:nJ} is some 
subset of variables. Let .S(f,¥) denote the set of all restric- 
tions of f to Y obtained-by setting al7 remaining variables not 
in Y to be constant; that {s, | 


: - ne Y 
S(f,Y) a {f | x= c, for x¢¥ [€,€{0.1} | | }. 


6.1 Lenma: Suppose f,Y, and S are as.above. If F is any Bo- 
forma for f, and F contains k > 2 occurrences of Y variables, 


then 


‘[s(f.¥)| < 168! 


Proof: We prove this fact by induction on k. Suppose f,Y,S, 


and k are as in the theorem. 


In the event k=2, let x, and x; be the. Y-variables occurring 


in F (where possibly i = j}. Since no other Y-variables besides 
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X; and x; appear in F, all restrictions f Fie Sor aay 
of f to Y are functions of the variables {X45 oX 5} - Since there 


are at most 16 such functions, this proves the lemma in this case. 


Now suppose that k > 2, and ie’ Tenia is true for all smaller 
values of k. In the tree corresponding to F, there exists a 
subtree of F with-some number 2 of occurrences of Y variables, 
where 2 <2 < k-1. This is true since the sons of a node in a 
binary tree create a partition of the leaves above the node. 

' By passing upward? from the root of F, one may find ‘such a subtree 
since each leaf contains at most 1 occurrence of a Y-variable. 


Thus we can decompose the formula F as 
F(xqs-++9X,) = G(H (xy 5-44 5X) sXye00 9X, ) (1) 


where H has 2 occurrences of Y-vartables and G has k-& occur- 
rences of Y-variables. Let g(Z,x)sX5s.--X,) and h(x, sx 205 +X,) 
be Boolean fucntions defined by G and H respectively. 

We may bound the number of restrictions of f to Y by bounding 
the number of restrictions of g and h. Let Y° denote the set 
of variables Yu{z}. Then G _ has exactly k-z+1 < k occurrences 


of Y~* variables. By (1), since any restriction of f to Y 


In a formula, as in all circuits, we think of the constant and input 
nodes as being at the top of the formula and arcs directed 


downward. 
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satisfies 


) 


f | . ao g(h | 4 GoXy Ky reee Ky We ra 
or xéY¥ 


for xéY . for x4¥ 
there can be no more restrictions of f to Y than the product 


[S(KLY)| + |S(g,¥7)|. 
By induction, this implies that 
IS(#,Y)| < 1627! . ig hk-BFHD-T 4 ggkel 


and the proof is complete. O Lemma 6.1 


6.2 Theorem (Netiporuk): Suppose .feB, is a Boolean function and 
Pj »Pos--+sP, is a partition of the variables {xy | de[Tin]} such 


that f depends on some variable in P; for each ie[I:r]. | Then 


r 
Lo (f 1/4 Jogi S(f,P. 
B, fF) 2i/ th og| ( PEL 


Proof: Suppose F is a minimal Bo- formula for f. For each 
ie{1:r], let k, be the number of occurrences of P,-variables in 


-] 
F. If k; 2 2, then [S(#,P,)| < 161 by Lemma 6.1, so 
k;-1 = log ¢|S(f.P;)| = 1/4 logy |S(f.P,)|. If k, = 1, then 


[S(#,P,)| < 4, so k, = 1/4 log [S(#.P5)|5 the case k, = 0 cannot 
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happen since f depends on some variable in Pi. In any event, 
k, 2 1/4 log |S(f,P;)|. Since in (F) = Fk » this proves the 
i i . By qe 1 


theorem, O Theorem 6.2 


We may now use this theorem to establish lower bounds on the 
complexity of | particular m.b.f.'s. Note that if ns = |p. | for 
ie[I:r], then [S(fP;)| < gi since this is the number of Boolean 
functions of n,; variables, and IS(f42.) Is 2"-n; since this is the 
number of settings of variables outside P;. Choosing ny ~ log n for 


each i gives a partition of the variables into ~ n/log n sets with a 
possible maximum value of log|S(f5P,)| of n-log n, to yield a 


theoretically possible a(n@/10g n) lower bound using Theorem 6.2. 
It is possible to show that tha maximal lower bound obtainable from 
the theorem is a(n@/log n). 

Using an example similar to Netiporuk's original function, we 


establish the following example: 


6.3 Theorem (Paterson, Bloniarz): For every n, there is an m.b.f. 


eB, such that 


(A) Ly (f) = a(n*/log n) 
and 


(B) ML(f) = O(n*/log n), 
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so that allowing a complete basis in a formuta for f does not 


allow more than a constant factor reduction in size. — 


. m 
Proof: Suppose for the moment that n = (hve) where m 
is an odd integer. Note that m divides n. To define f, we 
regard the n input variables as arranged in a (n/m)xm rectangu- 


lar matrix 5° For each ic[1:n/m] and je[1:m], we will fix 


Fig. 6.1 The Inputs 


m 
94; to be a distinct subset of [1:m] of exactly (ime1)/2 
elements (which is possible by our choice of n). Define 
f:{0,1}" + {0,1} by 


+ V 
FXG) gett ensmy DGy 4 ¢ Nt env) (00 jPk@))) (2) 
Jje[1:m] k#i 
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Intuitively, we may think of a a; "sas row patterns: ~- 


ij 
(mt+1)/2 entries which will be chasen from .a row of m variables. 


If ke[1:n/m] and Ng | is true, we, will say that row. k 


qe0; ; 
satisfies pattern lige Note that this implies that row k has at 


least (m+1)/2 ones in it. The laude 


Ke[1in/m] qe, Xkq) 
k #1 oP 


th 


is one if. some row other than the i satisfies roe Finally, — 


#(X; 5) = 1 if there is some variable Xap “equal td | ‘| whose cor-" 


responding pattern OF is satisfied by some row other than the 


jth. 


j 
We note in passing that f is monotone by Theorem 2.4. 
To obtain a lower bound using Theorem 4.2, consider the rows 


P. = {x Xj | je[1 mJ) for ie[1l:n/m] to be a partition of the input 


variables. We obtain a lower Bbdne on Is(f, p jl. W.1.0.g. 
assume that i = 13; we show that a large number of settings of une 
remaining variables. yield different restricted functions of f 


to -P). In particular, suppose c;. for iel2: n/m), jef1:m] - 


ij 
are constants such that each row of constants has fewer than 
(m+1)/2 ones in it (that is, [{J ie = 1} < “(m#1)/2 for each 
ic[2:n/m]). We show that two different such settings of the | 
variables Xi = &4j 
restricted functions of the first row. 


for ie[2:n/m], je[1:m] yield different 
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Let fz denote the restriction Oe. : for ie[2: n/m], 
Xi 


=¢ 
ij 

je[T:m]. The first row is the only possfble row which can satisfy 

any pattern since each other row has fewer than (mt) /2 ones in 


it. Hence the inner conjunction is false for “k # l, “and ae 


VY A_A xX 
FE defasn/mj 15” 9€055 “14? 
je[1:m] 


tf #C* are two distinct such constants, w.1.0.g. let 7 


Now if 
and j. be indices such that Ci = } and. C73" Q. Define -an 


input for the first row. by 


1 if geo, j | 
%q = : for each qe[1:m] 


0 1f afo,, 


and note that the row X1q satisfies pattern 94, and no other. 
5; sy = ot a 
Hence F(X q) 1 whereas 3 (x4q) 0, and the two functions 
fe and fz. are different on the input X19" 
Thus |S(f,P,)| is at least the number of such settings of the 
remaining rows with fewer than (m+1)/2 ones per row. Since there 


m-1 


are 2 ‘length m piney vectors with fewer than (me)/2 ones , 


we obtain 
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[S(#,P,)|-2 Fn = gh-(n/m)-m4l 


‘Hence, by theorem 2, 


Pee ay a 
2 : N-(n/mM)-Mect 
By i eae 


= 1/4 [(n@/m) - (n2/m2) - n + (n/m)]. 


By Stirling's formula, [Knuth 1968, p. 46], 
m 
n= ((ns1)/2) ~ /27im 2" 


so m~ log n. Hence Lg, {f) = a(n2/m) = a(n2/1og n) and the lower 
bound is established. _ 

The upper bound on ML(f) is obtajned by expanding and fac- 
toring the definition (2) to get a more compact formula for f. 
Since f’ is not explicitly defined, we prove that an M-formula of 
size 0(n2/10g n) exists without actually exhibiting it. Clearly, 
f has an M-formula of size o(n2), namely (2). 

To get a smaller formula for f, we can distribute the variables 


in (2) to get the disjunctive form 


Ste . vo. 
F(X) = a eetinsm] ke(Tin/m] %i5 * des, *kq) - (3) 
je[1:m] k #7 J 
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If we define, for ke[I:n/m], - 


ty) VU 
15 Tim] Berea onc cee 
P: 

then by reversing the indexing in (3) we obtain 


afm 
f(x) = V #, (x) 
- k=t 
n/m . 
so ML(f) s a ML(f,). To bound ML(f, ) for a fixed ke[1:n/m], 


note that each monome in PI(f,) consists of the distinct product 
of (m+l)/2 variables from row k (which has m variables in all) 
and a single variable not in this set. Since the number of prime 
implicants of f, far exceeds m, by factoring (4) on the vari- 


ables in row k we obtain a smaller formula for ne 
6.4 Lemma: Suppose r,SeN, and GeB eg | is a function such that 
y= Ve (ya A xa) 
G(X aoe aX aVyoeee Ve) = y, A AX 
7 1 r’"} s ja! jeA, j 


where,.for each i, A; is a unique subset of size a of ‘[l:r]. 


Then 


mg) s (3!) + (7) - 1. 
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Proof of Lemma: We proceed by induction on r. If r= 0, then since 
there is a’unique subset of size 0 of 9, g(y,) is either y, 
or the constant function 0. In either case, ML{g) <1 = (I) + (3) - 1. 
| If r>Q-and a=r, then clearly G(X] 9+ ++ 2Xpo¥y) = 4 
YpAXA---AX, or g = 0, SO ME(g) é reve (MH 4(K-1, 
Similarly, if a= 0, then again -ML(g) <1 = (1) + (5) - 1. 
Now suppose N<a<r.. By factoring the definition of g by Xn 


we may express 


g =(x.-9,)¥ 95 (5) 
where 
aN | 
and | 


eee ee es 
99 = Ge[1is]-T% * ™ x3) 
1 


and T is the set of indices ie[1:s] ‘such that —TeA;. - Note 
that each subset A; \ {r} for ieT has a-l members chosen 


from [l:r-1], and hence inductively 


mi) « G4) + CD =. 


Similarly, each set A, for if¢T has a members chosen from [1:r-1], 


and hence 


~172- 
ML(g5) s (a) + (nl) ee 


Thus, by. the expression (5), 


ML(g) s 1+ nig) + oy 


514 (7) $4, ‘). -14(f)4 (nl) 4 (by 
q induction) 


7 a + (2) -"1 (usng the fact that eG) = Go) Hes yy 
[Knuth 1968, | p. 54)). 
O Lemma 6.4. 


Now, sinceLemma 6.4applies to f,(%) with r=m and a= (m+1)/2, 


we obtain 


ML(f,) < at + ((ms1}/2)) = 1 
= ( (net) /2) + a“ (oni) 
=3n- 1. 


mn 2 2 
Hence ML(f) < i (3n-1) = 3n°/m-- n/m = O(n*/Tog n). 


To finish the proof, for n not of the form (13/2) (m odd), 
we could choose Ng of this form with (n/4) < Ng sn and aeely 
ignore all but the first Ng of the n arguments. 

) Theorem 6.3 
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ee ems tes eens 


In a counterpoint to the previcus case, we present an example 
where the larger complete basis B, can allow a reduction in formula 
size of more than a constant factor. The example is a monotone 


cover of the parity function. 


Definition: Suppose FeB., is an arbitrary Boolean function. An_ 
m.b.f. 98a is said to be a monotone cover of f if 


ore 


: F(x) 5-. Xd) = = 9(Xj+7%> Gates > “xX ae 


Example:  1f PMX) 2 aXe) = xP Xp + ® Ky is the parity. function, 
then g” defined by | 


n n 
n = n \ ; ‘ VY ‘ 
g (Xy9¥ Xp Var er + 9% ry) [P pee yyy teeny) (6) 


is easily verified to be a monotone cover of Pp", 
We claim that the function g" exhibits the properties desired, 
We use the following result of. Khrapchenko [1971] which we state 


without proof. 


6.5 Theorem: If neN, then 


n2 < Ly(P") < ane . 
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The fact that L ces) < ané ee nd 
dividing Pp” (Xp +ooX,) as (ye. . @X n/a )@(x 0/3 412-5: OX). 
The lower bound is established by a ‘remarkable argument wirich uses 
the isomorphism between series- sparale! contact. networks and U- 
formulas. | 


Assuming Theorem 6.5, it is straightforward to show that 


ti < ML(g") eae + 4n, 


To derive the upper bound, suppose that F is a minimal U-formula 
for Pp", By. applying DeMorgan: Ss laws, we may distribute all nega-~ 
tions in F to the variables without. changing the size of the for- 
mula, sO we assume w.1.0.g. that F has this property. Let Fr. 
be the formuta obtained by replacing all occurrences of “K, in 

F by the literal Y (for all ic{1: n]). A straightforward | 


argument demonstrates that 
n n 
CFO 8 gDy Oxpyyg Ivy yy Oyayy) 


is an M-formula for QP (xy s¥y 00+-X ol)» 


For the lower bound on ML(g"), suppose that F(x Yor 9X Vy) 
is any M-formula for g”. By replacing Ys. by Ty in F for 
each jic[1:n], we obtain a U-formula for P", which by Theorem 6.5 
must have né occurrences of literals. . Thus -F must have had 


size n°. 
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Since P” has a Bo formula of size n-l, Le (g") < 5n-1. 
2 


Since g" has) 2n variables, we have proven 


6.6 Theorem(Bloniarz, Meyer): For every ne there is an m.b.f. feB 


such that 


ML(f) = o(n2) 
and 
Lp (f) < 5n/2 + 0(1), 
2 
Note: Because of Theorem 6.5, any U-formula for g” 


must also have at least n@ occurrences of variables. Thus this 
example does not answer the question of whether the addition of 


negation to the basis B allows more compact expressions for m.b.f.'s. 


Section C. Open Questions 


(1) Is there any single-output m.b.f. for which 
Ly(f) < ML(f); for which Cp (f) < MC(f) ? 
2 
(2) Is there a larger cap than that given in Theorem 6.6 
between ML and Lp ? 
2 
(3) Do larger gaps between these measures exist for multi-output 


functions? 
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CHAPTER 7 


A Shortest-Path Algorithm 


This final chapter is concerned with the prebltem of finding 
the shortest distance matrix for a non-negatively weighted directed 
graph. Several algorithms to-solve this problem have been proposed 
[Dijkstra 1959, Floyd ;1962, Spira 1973, Fredman 1976], ‘the best . 
worst-case running time being 0(n3(1ogteg-n) !//¢10g -n) !/3) 
[Fredman.1976}. «However, Spira's algorithm, which has a worst-case 
time of a(n? log n), has. average running time On? (leg a)*).: 

Spira's algorithm-basically searches for the:nodes ctosest to ~ 
a given source node. By searching along edges: of minimum weight first, 
it is likely that. the shortest paths from the source node to all 
other nodes will be discovered before-all:arcs.in the graph need to 
be traversed. - In fact, starting from any given source, on the 
average. only O(n-log n) arcs needto- be traversed. By repeating - 
this process for all n source nodes, the shortest path matrix can — 
be found. 

In Appendix 3 we show that several Tacunae remain in Spira's 
algorithm. In particular, Spira does not specify any search pattern 
when there are paths from the source of agual “Tesigtti.- We observe 
that arbitrary choices among paths of equal length do not solve the 
problem efficiently; inappropriate "tie-breaking" rules can result 
in algorithms which have a(n?) running time on almost all 


matrices. One example of this phenomenom.is presented in Appendix 3. 
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Our main result (this work was:done jointly with A. Meyer 
and M. Fischer) is a correct version of Spira's algorithm, which 
we prove does indeed run in 0(n(10g n)?) average time for a 
broad class of probability distributtons.on dtrected graphs with 
non-negative weights. These distributions properly -include the 
distributions for which Spira originally ‘made his:claims. Infor>. 
mally the condition characterizing distributfons for which our 
results hold is that information about the entire graph except 
for the arcs emanating from any given node. i, together with~ 
information about the weights of the arcs emanating from node i >» 
is not correlated. with the assignment of the weights to those 
arcs -- all permutations of assignments are equally likely. 

Finally, a modification of thts algorithm is noted which 
computes the transitive closure of a Boolean matrix in average 
time O(n? log n). This result has recently been ‘improved by 
Schnorr [1978a]who exhibits an o(n?) average -time algorithm for 
this problem. | 


Section A. Shortest Paths and Graph Distributions 


Directed ataphe were defined in Chapter 2. We henceforth 
assume that every n-node graph has a set of vertices V = [I:n]. 
A graph (V,E) is weighted if there is a cost function 
c:E +R u{-»,~} which assigns a weight or cost to each edge in the 


* 
graph; the graph is non-negatively weighted if range (c) c¢ R . 
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We consider only weighted graphs of this latter type and refer to 
them as weighted graphs. 

One standard representation of a-weighted graph is the nxn 
cost-matrix C, where C(i,j) is defined to be the cost of edge 
(1,5) if edge (i,j)cE, and Cli,j). 2° if (#,g)¢ET. One 


additional representation will also be used. In the sorted list 


of edges representation, an n-node graph G is represented as a | 
sequence of n adjacency lists. Each list element is a pair (aw), 
where a is a vertex of G and w is a weight in R . The jth 


list 
L. = ((a,, Wi) lao Map ber (Big May) 


represents the k, edges emanating in G from node i (including . 


possibly an edge from ji to itself); a is the endpoint of the 


ij 
arc and Wig is its corresponding weight. In addition, we 

require that the weights in each adjacency list be arranged in 

| = : 

increasing order Wip S Wig S eee Ss Win In the case where 

a eg ee 

t We do not distinguish between missing edges and edges of weight o, 


The definition of < and + on R’ are extended to R_ in 


the usual manner. 
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there are multiple edges in. the adjacency list of the same weight, 
we assume that the endpoints are placed in the-list in some 
nhesceibed order; for definiteness we: assume that they are in: 
order of increasing node index. * | 

We will represent the adjacency lists as two. nxn matrices - 


the endpoint matrix. A and the edgecost matrix W, where 


and 

na if ks k; 

W(i,k) = 
eo if k>k,- 
(NIL is a special symbol ), Figure 7.1 exhibits different represen- 
tations of a particular weighted graph. : . 
Suppose p =(VysVoo--09¥p) is a path in G, that is, a 

sequence of k 21 nodes such that (V5 2¥j 47 )eE for ie[1:k-1]. 


The cost of path p is the sum of the costs of its edges; that is, 


“Tetra where the cost of the trivial path (v) from v_ to 
v is 0. The minimum cost matrix M of G is the nxn matrix 
which has M(i,j) equal to the minimum of the costs of all paths 
in G from i to j. The shortest distance problem is that of 
computing the minimum cost matrix M given the cost-matrix C of 


a weighted graph. 


+ this assumption is actually unnecessary for either the correctness or 
the timing of our algorithm. 
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5 5 1 
© -) 0 
2 * 1] 
Cost-Matrix. 
A: | 1 2 3 OW: 5 5 10 
3 NIL NIL : jo «@ -» 


3 1 NIL fl. 2 0 


~ Sorted List of Edges” 
Fig. 7.1 Representations of a Graph 


6 will denote the set of all. n-node weighted graphs. The 
subscript n will usually be omitted. Through the cost-matrix 
representation of graphs, we will identify 6 with cr *y?? F 
A map 1:G+6 isa row permutation if it simply permutes the 
endpoints of the edges emanating from a specified node while leaving 
the remainder of the graph intact; formally, there must be a permu- 
tation p of [l:n] and an index igelt sn] such that, if G@ is any graph 


in G with cost-matrix C, and 
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C. is the cost matrix of 1(G), then 


Cli,j) if i#i 
c,(4.4) = | : 
| C(4,°(5)) if i = ig 


for every (i,i)e(1in}?. The set of row permutations generates a 
group T, of transformations on G, each member of TF, as called 
an adjacency transformation. | 

If P is a probability measure on G, its distribution function 


2 
is the function Fo:L RPS txer | Osx <1} defined by 


F p(X) = P{ CeG}e(i,3) sj, for (is)eL1 en] 


where X is any element O45 | CijdeLI: ny’ ) of [ R #2 
Every probability disertbution is uniquely characterized by its 
distribution function. 

We will say that a probability measure P on G is adjacency 
invariant (A.E.) if any adjacency transformation eT, isa 
measure preserving transformation om .G;. that is, Fo (m(x)) = Fo(x) 
for all xe[ Ry". We will consider specific examples of contin- 
uous A.I. probability measures later in this chapter. For a 
discrete probability measure on. G, note'that the measure is A.I. 


if and only if the probability of any graph is equal to the 
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probability of the graph obtained by permuting the endpoints of - 
the edges emanating from an arbitrary node in the graph. 
For any function f:G+Rand any probability measure P on G, 


let ED(f) denote the expected value of fF with respect to P. 


Section B. The Algorithm 


Before describing the algorithm, we make an observation about 
weighted graphs which serves as basis for the algorithm. Suppose 
that G is a weighted directed graph, and that i is an arbitrary 
"source" node of G. Define, for any node jeNODES(G), D(j) — 
to be the minimum of the costs of all paths in G from i to j, where 
D(j) is = if there is no path from i to j. Let NEAR bea 
subset of the nodes of G such that ieNEAR; those nodes in NEAR 
are called near nodes, and all other nodes are called farchodes: 
For each near node j, we define REALK(j) to be the ‘far node 
k of G such that C(j,k), the cost of the edge from j to k, is 
minimal among the costs of any edge from j toa far node. If j 
has several edges of equal minimal cost emanating from it, then 
REALK(j) is arbitrarily selected to be one of them. -REALK(j) - 
is undefined if j has no arcs to any far node. 


We make the following observation. 


7.1 Lemma: Suppose that G is a directed graph, and that NEAR 
satisfies the property that for every jeNEAR and kéNEAR, 


D(j) < D(k). Let ig be a near node such that the sum 
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D(ig) + Clips REALK(jp))is minimal among all near nodes, and let 


ky = REALK(jy). Then 


(1) D(ky) = D(jq) + ClIgsk9), 
and (2) D(ky) < D(k) for every KkéNEAR. | 

Proof: Part(1). Since D(ip) is the cost of some path from 
i to Jo: we know that D(5p) + C(5goky) is the cost of a path 
from i to ko» and hence D(ky) < D(jq) + Cig kg). Suppose to. 
the contrary that D(ky) < D(5g) + ClIgskg). and let 
T= VpsVooeeesVe = ky bea path of weight Dig foe i to kp 
Observe that 2 21 since feNEAR. Let Lo be the least index in 


s a’. Thus 


[1:2] such that v, ¢NEAR; observe that 1 < Lo 
0 wa 


oy eNEAR, and hence 
Ly7] 


Div, _,) + C(v, _,.REALK(v, 1) < D(v, ,) +C(v, 4...) (by ‘def. of REALK). 
tal Lyn ual del to 1? "hg Y 
hyo] 
s 2 C(v avi) (since vys--.s¥ 


5 | 
m1 0 


isa path to Yel 
2-1 
s ayo me net) 
= D(ky) 
< DlIg) + C(¥yrky). 
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But this latter inequality contradicts our choice of Jo and 


part (1) is proven. 


Part (2). Suppose to the contrary that k is a far node such 
that D(k) < D(ky). Let i = Wy Worse sWy =k be a path of 
length D(k) from i to k. Again choosing Xp to be the least 
index in [1:2] such that W, ENEAR. we must have 1< 2%) s 2 


and w eNEAR. Thus 


dg-] 


Diw, _,) + C(w, _, REALK(w, _,)) < D(w, _,) + C(w, -4, Ww, 
&y-l dal, hn Xgl Xo 1 Xi) 


lA 


2 CW Wie ) 


D(k) 


D(k 


A 


i 


which is again a contradiction to our choice of jg: 


Oo Lemma 7.1 


By the above argument, if NEAR satisfies the hypotheses 
of Lemma 7.1, then so does NEARU{ jg}. | 
We may use Lemna 7.1 as basis for an algorithm for constructing 


the minimum cost matrix M for a graph G. We first present a 


-186- 


version of the algorithm which uses certain simple operations on 
finite sets. We later note how these set operations can be imple- 
mented in terms of standard operations and data structures on random- 
access computers. | = - 

We assume that the graph is presented in its sorted list of 
edges representation as matrices A and W. Extraction of A and 
W from the cost matrix C may be performed by sorting the rows of 
C ina stable fashion* using edge weights as keys. The algorithm 
constructs the minimum cost matrix M one row at a time; this cor- 
responds to finding the shortest distance from one particular node, 
say node i, to all other nodes in the graph. The algorithm 
searches from node i in a manner dictated by the weights of the 
edges; shorter paths are searched first. 

In the algorithm, NEAR and D are as described in the hypoth- 
eses of Lemma 7.1 with one exception - D(j) is only defined for 
near nodes. If j is a far node, then D(J) = NIL. Initially 
NEAR is set to {i} and D(i) to 0. The algorithm halts with 
NEAR equal to the set of nodes which are reachable by paths from i. 


T a-sort is stable if the relative order of elements with keys of 
equal weightis preserved at the end of the sort (cf. Knuth [1973 
Ch. 5 J). 
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Inductively, to add a new node to NEAR, the algorithm attempts 
to find a node Jo satisfying the hypothesis of Lemma 7.1. A 
priority queue is used to direct this search (see [Aho, Hopcroft, 
and Ullman 1974] for definition). In the course of. execution of 
this algorithm, certain edges on a near node's adjacency list will 
be "examined"; initially no edges in the ‘graph have been examined. 
Inductively we assume that all edges previousty examined point 
to near nodes, and that.all examined edges on an adjacency — 
list will consist of those edges of least cost. A pointer f(j) 
is maintained for each node. j in [1:n]; its value is the index 
in matrices A and W of the next (teast-cost} unexamined .edge 
on j's adjacency list. P(j) is initially set to 1 for all nodes. 

Each near node j has an additional value associated with it, 
which we will call KEY(j). For these nodes, KEY(3) is the sum 
of D(j) and the weight of the next unexamined edge from node j; 
that is, KEY(j) = DCJ) + W(j,P(j)}).  Sinee the examined edges on 
j's adjacency list all point to near nodes, we know that 
KEY(j) < D(j) + C(j, REALK(j)). 

To add a new vertex to NEAR, the algorithm selects a near node 
j such that KEY(j) is minimal among all near nodes. Suppose 
that k = A(j,P(j)) (the endpoint of the least-cost unexamined edge 
from node j) is a far node. Then j= Jo satisfies. the conditions 


of Lemma 7.1 since 
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D(j) + C(jsk) = KEY(§).< KEY(5~) 
< D(j-) + C(i-, REALK(S~)) 


for any other near node j°, and since C{j,k) ts the least cost of any 
edge from j to a far node( all examined edges: point: to néar nodes). 
Hence k can be added to NEAR and °D(k) can be set to KEY(3) 
and still preserve the inductive hypotheses ‘of Lemma 7.1. 
On the other hand, if node k were already a member of NEAR, 
then its shortest distance is already known, so nething fs done. 
In either case, since the edge from j to>-k « A(j,P(j)) has been 
examined, the pointer P(j) +s moved to the next-edge from = j. 
This search is continued until either al} nodes. are found to 
be in NEAR or until there are no further edges to examine, in 
which case it follows from: Lemma-7.1 that NEAR-4s ‘the set of all - 
vertices which are reachable by paths from i(since a missing edge has weight ). 
One additional requirement is imposed-in order that the general 
algorithm given above have fast running time. If the node j in 
NEAR from which we are searching has several edges of equal weight, 
then these edges are considered in consecutive executions of the 
éearch loop. That is, if W(j,P(j)+1) = W(j,P(3)), then we require 
that the algorithm select j as the near node from which to search in 


the next execution of the search loop’. 


ui Together with the specifications on the tie-breaking rules of the 
priority queue below, this requirement avoids the difficulties in 


Spira's algorithm noted in Appendix 3. 
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To select nodes with appropriate KEY values , we use a priority 
queue. Three priority queue operations are necessary. CLEARQ 
sets the queue to the empty state. MINQ returns and removes from 
the queue a pair (KEY(5) ,3) such that .j is a node in NEAR for 
which KEY(j) is a minimum among all nodes in NEAR . If. the queue 
is empty, MINQ returns the pair (NIL,NIL). INSERTQ (d,j) inserts 
node j into the priority queue and sets KEY(j) to..d.. In case 
there are several nodes in the queue with minimal KEY values, 

MINQ selects and returns any one of them. (We assume only that the 
queue operates so that the value returned by a MINQ operation 
depends solely on the previous sequence of queue operations.) 

The fully specified algorithm, Ris which computes the shortest 
distance D(j) from node ji to node j, is given below in a 


PASCAL-like language, where » is represented. by NIL: 


FOR 


FOR L 
NEAR : 
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Algorithm R, 
L := 1 ton DO D(L) := NIL; 
=lton 00 P(L) := 1; 
= (1H 5 DC) 2= 0; J t= NILs CLEARQ ; 


IF W(i,1) # NIL THEN <KEY,J> := (i,1),1>5 
Comment If there is an edge emanating from i, then KEY is set 


to the minimum weight of all edges from i. 


WHILE J # NIL DO 


Comment J-is the near node to be searched from. If REALK is 


defined for some near node, then J # NIL, 


BEGIN REPEAT K := A(J,P(J)); 


END | 


Comment K is a candidate for REALK(J). 
IF KéNEAR THEN | | 
Comment K is REALK(J). 
BEGIN NEAR := NEAR u {K}3 
D(K) := KEY; 
IF |NEAR| = n THEN HALT; 
IF W(K,1) # NIL THEN INSERTQ (- KEY#H(K, A), a 
END; 
P(J) := P(J) +1 
UNTIL = W(J,P(0)) # W(J,P(J) - 1); 
Comment Examine all equal-weight edges emanating from J 
at the same time. a : 
IF W(J,P(J)) # NIL THEN INSERTQ (D(J) + W(J,(P(J)) J) 
<KEY,J> := MINQ: 
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To compute the shortest-distance matrix M, algorithm R. is 


run for each ie[1:n]. 


Shortest-distance algorithm 


FOR. 2 229 toon: v0: 


FOR J := 1 ton DO M(I,d) := D(J) 
END . 
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Section C. Analysis of the Algorithm 

If G is a weighted graph, we will denote by N.(G) the 
number of times the test "IF K#NEAR" jis executed by R, on 
graph G; that is, N.(G) is the number of edges of G examined t 
by algorithm R.. Note that only a fixed number of operations are 
perforined by R. between successive executions of this test, so 
that the total number of operations performed by R. on graph G 


is proportional to N.(G) + 1. Thus we try to estimate N.(G). 


7.2 Theorem: For any ie[l:n] and any adjacency invariant 


probability measure P on G; 
Ep(N.) <n log, n 


Proof: Suppose that ie[1:n] and that P is an A.I.  prob- 
ability measure on G. If MT is any adjacency transformation, then 
I is a measure preserving transformation on G. Hence E,(N,) = 
Ep(N. elf) ). Taking the sum of such expectations over all adjacency 
transformations Il, we have 


) } 


aN 2 t)) a nef ‘Eph 7 11) ) 


1 


Ept, 


Peel 


n 


7 |-E(N) 


Formally, an edge from g is "examined" when the pointer p(y) 


iS set to the next entry past the edge in the adjacency matrix. 
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thus, to prove that E(N,) sn log, n, it suffices to show that 


E 
pt i, 


Y (N. ° m/{T1) <n Tog, n. 
et, $ay: S 


We will prove the stronger statement that, for any graph GcG, 


[a Mca 17, $ nloq.n «7 


So suppose that GeG is fixed, and let G, be the set 
{1(G) | eT}. An easy argument demonstrates ‘that thé left-hand 
side of inequality (1) is simply the average value of N, with 
respect to the probability measure PL which assigns each graph 
in G, equal probability, and all‘other graphs probability zero. Hence 


we must show that 


Ep (Ny) s n 10ge n. (2) 


Proving inequality (2) requires analyzing the average 
behavior of algorithm Ry over all inputs GcG,. A convenient way 
to carry out this analysis is to consider a probabilistic algorithm 
Re which has no inputs. RF is the same as R; except that where 
R, would reference inputs describing a graph 4G, Re generates the 


inputs randomly. Note that every gran in G, has the same 
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edge-weight matrix W as G; it is only the endpoint matrices A 
which. differ among the members of G)- Moreover, each possible 
endpoint matrix is equally likely, modulo the requirement that 
edges of equal weight in a row be Siranced th saievene tig: wiode order. 
Hence, the probabilistic algorithm Rj whiten we construct 
selects at random and with equal probabilities one of ‘these 
possible endpoint matrices and then executes 1 on the graph chosen. 
The selection of the endpoint matrix -A -is not: Gore: inittally but 
rather dynamically as Ry references A (which is done in left- 
to-right fashion across the rows of A). At the first reference 
by R; to entry A(J,K) for any J and K in Tien], a value 
of the endpoint A(J,K) is chosen. If J has several edges of 
equal weight emanating from it, then at the first reference to. 
any of these a selection of the endpoints for,all these edges of 
equal weight is made and entered in the matrix A in the proper 
order. In all cases, by selecting these endpoints from all possible 
unused endpoints fn an independent fashien with equal probabilities, 
one guarantees that every graph in G, will be chosen with equal 
probability. | 
Re is obtained by replacing every reference ACJ,K) in R. 
by the following probabilistic procedure A‘(),k). In this latter 
procedure, C fs an internal nxn matrix which A’ uses to etare 
the endpoints which have already been selected; C(J,K) is 
initially set to NIL for every J and K in [1:n]. is 


the number of endpoints to be selected, and . SET is a set variable 
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which collects those endpoints. RANDOM is a probabilistic 
procedure which returns an element of [lin] independently and 


with equal probability. MIN returns the minimal value of any set. 
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Procedure A‘ 


A’(Jsk): IF C(J,K) # NIL THEN RETURN (C(J,K)) 
ELSE | Oe 
BEGIN 


L := |{K°e[T:n] | W(J,K) = W(J,K~)}1; Comment The number of edges 
SET := 95 to be selected. 


REPEAT. 

B := RANDOM ; 

IF BA{C(J,K*) | Kee[1:K-1]} 

THEN SET := SETu{B} . 
UNTIL |SET| = L, 
Comment SET is now a random selection of L un-used 
endpoints of edges from J. 
For Jv =J to (J +L - 1) DO 
BEGIN 

R := MIN(SET); 

SET := SET-{R}, 

C(U°,K) :=°R 


END; 
Comment These L edges are inserted into the graph 


weight W(d.K) in the proper order. 
RETURN (C(J,K)) 


END; 
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We make the following obvious remark. 


7.3 Lemma: Let E, be the average number of times the test "if 
KZNEAR" is executed over all possible runnings of algorithm R;- 
Then Ep = Ep Ny) 

To analyze the probabilistic algorithm Rj » note that once a 
node becomes a member of NEARuSET, it will become a member of NEAR 
before the next call of procedure MINQ. Fhis is true since 
endpoints of equal-weight edges from J are considered in succes~- 
sive passes of the loop of R:. Hence, if NEARuSET = [1sn] at 
any point in the algorithm, then at most n further executions of 
the test "IF KéNEAR" will occur before R; terminates. 

Now let E be the expected number of times the procedure 
“RANDOM is ‘called in the execution of RF lint]  NEARUSET = [iin]. 
At any point in a particular execution of Ris the number of times 
the test "IF KéNEAR" has been executed is at most equal to the number 


of times nodes have been selected by RANDOM Since the endpoint of 
every examined edge wes at some point returned by RANDOM. Hence 


Ep (Wy) = ys E+. 


But E is nothing but the expected number of times nodes from 
the set [1:n] must be chosen randomly with repetitions and placed 
in the set NEARUSET until NEARUSET = [1:n]. This number is well 
known to be asymptotic with n log, n [Feller 1968, p. 225] so 
Ep Ny) < 1 log,n. 


{1} Theorem 7.2. 
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section D. Implementation 


There are several efficient ways of implementing a priority 
queue on a random access machine (R.A.M.) [Aho, Hopcroft, and 
Ullman 1974]. In particular, one can use a heap and ensure that 
every execution of MINQ and INSERTQ takes 
O(log n) steps. and every CLEARQ takes O(n) steps. Standard 
implementations allow computing membership in, and cardinality of, 
the set NEAR with fixed cost. Under these implementation assumptions 
the average number of instructive steps requfred to execute R. (for any 
ie[1: n)) on a R.A.M. is O(n 10g” n) for any A. I. probability ‘on G. 
Since the shortest distance problem is that of computing the 
shortest distance matrix M from the cost matrix C, we must 
include the cost of extracting matrices A and W from C. This 
can be done by sorting at a cost of O(n? 10g n) basic steps, and 


we have the following theorem. 


7.4 Theorem: Implemented as above, the shortest distance algorithm 
take an average of O(n? log” n) basic steps on a R.A.M. over any 


-A.I. probability measure on weighted graphs. 


For may applications, one wants not only compute the mini- 
mum distance between all pairs of nodes i and j, but also to 
find a pati in the graph which achieves that minimum cost. A 
simple addition to algorithm R; enables one to retrieve such a 
path. A on®- dimensional array PATH is introduced and PATH({z) is 


initializesc :o NIL for every gell:n]. When a far node k is 
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added to NEAR by virtue of its being at the endpoint of an arc 


from the near node j, then PATH(k). is set to j. This fully 


specified version of algorithm R, is given below: 


FOR L 
FOR L 


FOR L : 


NEAR 


PATH(i) = 
IF W(i,1) # NIL THEN <.KEY,d> := <W(i,1),i>3 
WHILE J # NIL po -_ 


1 to n DO PATH(L) := NIL 


Al orithm P 


1 ton DO D(L) := NIL; 


1 ton DO P(L) := 1, 


:= {i}; D(i) 


BEGIN REPEAT K := 


= 0; J := NIL; CLEARQ; 


a2) PCO); 


IE KéNEAR THEN 
BEGIN D(K) := KEY; 


END; 
P(J) 


PATH(K) := J; 

NEAR := NEARU{K}; 

IF [NEAR| = n° THEN HALT; 

IF W(K,1) # NIL THEN INSERTQ( KEY + W(K,1),K) 


:= P(J)+] 


UNTIL W(J,P(J)) # W(J,P(3) - Ws 


IF W(J,P(J)) # NIL THEN ERD) + WJ, P(J)) J); 


<KEY,J> := 


MINQ. 


The paths from all source nodes may be stored in the obvious way. 
A straight-forward inductive proof shows the following: 


Suppose keNEAR at some stage in algorithm P Then if we define — 


i: 
the sequence of nodes: 


Wy =k 
WwW, = PATH(W, ,) for 2217 in N, 


and if Ly is the least index in WN such that W, = j, then 


1=w W _poeeeaW ww =k 
Ly Loy ] ae | 


is a path of cost D(k) from i to k.. 


A final application allows us to modify the algorithm to 
compute the transitive closure of an (un-weighted) directed graph. 
* 
If G is such a graph, the transitive closure matrix I (G) jis an 


nxn -Boolean matrix defined so. that 


1 if there is a path from i to jin G 


0 otherwise . 


1"(6)(i,4) = | 


The transitive closure problem is that of computing 1"(6) from 


the nxn incidence matrix I(G), where 


1 dif there is an arc from i. to j in G 


1(6)(i,3) = | 


, O otherwise. 
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We may identi fy an unweighted directed graph G = (V,E) with 
the weighted directed graph G~ obtained by assigning weights of 
zero to all edges in £ and weight = to:all missing edges '. 

We make the observation that the shortest distance matrix entry 
M(i,j) is equal to 0 iff a e) = T, 

Hence; any shortest path algorithm may be used to solve the 
_ transitive closure problem. For algorithms R; and''P., since 
all KEY's are equal to zero, one may imptement the priority queue 
in such a way that every insertion and deletion from the queue 
can be performed with constant cost. As pointed out in a previous 
paper [Bloniarz, Fischer, and. Meyer 1976] this results in a 


transitive closure algorithm with o(n2 log n) average time. 
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Section'E. A.I. Distributions 


We conclude by presenting several examples of adjacency 
invariant probability measures. 

1. For each ic[1:n], suppose P, is a-prohability measure 
on R. Let P be the probability measure on G obtained: by. 
selecting each entry of the weight matrix C(i,j) independently 
according to distribution P.. Then P, being the product measure 
generated by the measures Py» is A.I. Thi class properly. - 
includes all distributions claimed in Spira [1973]. 

2. Suppose R is some probability measure on an arbitrary 
set B, and F:G,, +B is a function which is. invariant under each. 
eT; that is, f o n(G) = f(G) for all ter, and: GG. Then 
the induced probability measure Ro f on Gis A.J. by construc- 
tion. Some specific examples include cases in which the weights 
of edges might be specified but all choices of endpoints are 
equally likely. For example, in the discrete case, one might 
specify a distribution on the sum of the weights of the edges 
leaving each node, and specify that each graph with the same sums 
be equally likely. Or one might specify a distribution on the 
maximum weight of any edge in the graph and specify that each graph 


with the. same maximum be equally likely. 
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Section F, Open Questions 


1. Schnorr [1978a ] has recently exhibited an 0(n2) average- 
time algorithm for transitive closure which improves on the ideas 
above. His algorithm does not construct the paths between all 
pairs of nodes; does a transitive closure algorithm exist which 
does construct the paths (as the algorithm P. does) and which has 
o(n@ log n) average time? 

2. Is there a larger lower bound to the worst-case running time 
_Of any shortest-path algorithm than the Qn?) obtained by adversary 
arguments? Can a lower bound on the average-case running time of 
such algorithms be established? For a related paper, see [Yao, Avis, 
and Rivest 1977, Graham, Yao, and Yao 1978, and Yao and Rivest 1978], 

3. The fastest known algorithm for computing the transitive - 


2.81..) basic steps in the worst case [Fischer 


closure requires O(n 
and Meyer 1971]. Can this algorithm be improved upon?t Can Fredman's 
(worst-case 0(n°)) shortest-distance algorithm be improved upon? 

(Fredman [1975] has observed that Spira's algorithm may be 
modified to compute the shortest-distance matrix M with a average total 
number of comparisons of only O(n? log n) but at a significant increase 


in total running time when all operations are considered, ) 


t+ Recently, Pan [1978] has improved on this result and has 
announced (1979) an even further improvement. 
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APPENDIX 1 


In this appendix we sketch a proof pointed out to the author 
by V. Chvatal’ of the lower bound on: H(n) stated in Theorem 3.14. 


We use notation as in Chapter 3.. 


Theorem: Suppose O0<e<1/8, Then, for arbitarily Jarge n, there 
is an n-node undirected graph G_ such that D(G_) > nen'l-e) 47, 


Hence #(n) 2 none) 47, 


Proof Sketch: Suppose O<e<1/8 We will show the existence of an 
n-node undirected graph Ge (for arbitrarily large on) for which 
D(G_) = n-n’-€+1 . First we make some observations. . 
Suppose that G jis an arbitrary undirected graph, and let 
cs (hy >... sh.) be an exact cover of G by complete bipartite 
graphs G)s--.G.. Suppose G, is a complete ke-2, bipartite 
graph for ic[1:m]. Cbserve that if beth k. and &. are at 
least 2 for some ic[{1:m], then G must contain a 4-cycle; that 
is, there must be nodes a,b,c, and d in G such that edges 
{a,b}, {b,c}, {c,d}, {d,a} are in G. Hence if G were a graph 
with no 4-cycles, we may without loss of generality assume that 


each graph G, isa(1,£,) complete bipartite graph. Let v, 


private communication, 1977. Recent research by Ber nond and Chung 


{Bermond 1978] have shown that 


non 9/24 + 6, H(n) <n- log,n + 0(1) 


for arbitrary 6>0. 
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be the node in h.(G,) corresponding. to the singleton set in G.. 

It is clear that iv, | ie[1:m]} is a node cover of G, 
that is, a subset of the vertices cf & such that every edge in 
G contains at least one member of. the. set. Hence, D(G) equals 
the size of the smallest node cover of. G, as. Jong as. & contains 
no 4-cycles. 

If; O<e<1/8 4s arbitrary, then Erdos [1959] has proved the. 
existence of an arbitrarily large graph G_ with n nodes which 
has no 4-cycles, and for which every set of at least nie. 
spans at least one edge. Hence any node cover for G. can omit 
at most n!-€.} nodes, $o b(G ) > n-n' 841, 


D theorem: 


nodes 
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APPENDIX 2 


In this appendix we explore several questions in the M-circuit 
complexity of combinations of functions. A general theorem, due to 
Galbiati and Fischer, is presented in which a combination of finct ions 
f and g is considered in which f and g depend on one variable 
in common’. This general proof includes as a special case M. Fischer's 
result that MC(fxg) = Mc(f) + Mc(q) [Paul 1976]. Finally, we 
remark that these results also apply to the measures NC and MC, - 
Additional research on combinations of functions which are the dis- 
junction of variables has been reported by Lamagna [1975], Neciporuk 


[1971], and Tarjan [1976] . 


1. Theorem (Galbiati_and Fischer): 


Suppose n,m,k, and ge, and feB 


m.b.f.'s. 


ok and ge«B are 


n+] m+] ,2 


Define FageB inst, ke by 
FAG(X) 2000 oXq Vy 002+ YqoZ) = (F(X] 90+ 9Xq0Z) G(Y] 90+ + eZ) 


Then MC(fag) = MC(f) + MC(g). ~ 


Proof: Clearly MC(fag) < MC(f) + MC(g); we prove the reverse 


ae | 
This result has been presented in [Galbiah and Fischer, 1978] since - 


the writing of this paper, 
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inequality. 

Suppose N .is a minimal M-circuit me bois raat fag. We 
will Say a gate GeGates(N) is a mixed gate if & depends struc- 
turally | on some variable from x = {Xy s+. “My ‘4 and some variable 
from Y = inssedays }. We show that it is possible to re-structure 
N to eliminate all mixed gates and result in a circuit N° which 
has no more gates than N and which still computes fag. If this 
is the case, then N consists of two disjoint circuits; one consists 
of Succ (X,N)u{z} and contains a gate computing (i, z), and the 
other consists of Succ *(y, N yutz} and contains a gate which computes 
g(¥:z). Hence the total ‘number of gates in N° (and therefore N) 
is at least MC(f) + MC(g). . 

So let MIX = {GeGates(N)|]G is a mixed gate} and suppose 
MIX #9. Then I(MIX) #9, so we can select an arbitrary gate 
GeI(MIX) and Jet Pred(G) = {H,J}. Since N is minimal, neither 
H nor J is a constant node. We will replace G with another 
gate which is not mixed and yet still have.a circuit which: computes 
fag. | | 

Since H and J are not in MIX, but G& is, either H depends 
solely on Aut?) and J depends on Yu{z}, or vice versa; suppose 
wW.1l.0.g. that H depends on Xu{z} and J depends on Yu{z}. 


We first consider the case that G is an a-gate. In this case 


PI(G,N) ¢ PI(H,N)-PI(J.N) 5 (1). 
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that is, every prime implicant of G is a product of a prime impli- 
cant of H and a prime implicant of J. By Lemma 2.8 , since no 
component of fsg has a prime jmplicant which contains both a var- 
iable from X and one from Y, we may eliminate from PI(G,N) all 
such monomes which contain both a variable from xX and a variable 
from Y, and still have a circuit which computes fag. We. consider 


seveal cases. 


Case 1: Neither PI(H,N) nor PI(J,N) contains the monome z. 


In this case, every monome in PI(H,N) contains an X-variable 
and every monome in PI(J,N) contains a Y-variable. Hence, by (1), 
every monome in PI(G,N) contains both an X-variable and a 
Y-variable so G may be replaced by a node which computes the 
function V@, the constant function 0. Since this new network has 
one fewer gate (namely G) than N, this contradicts N's minimality 


and hence Case 1 can not occur. 


Case 2: z is a member of both PI(H,N) and PI(J,N). 


In this case, zePI(G,N). By an argument similar to Case 1, 
one can also show that all monomes in PI(G,N) different from © z 
contain both. an X-variable and a Y-variable and hence may be 
eliminated from PI(G,N) while still leaving a circuit which 
computes fag. Hence G may be eliminated and‘ replaced by the 


input node for the variable z, a contradiction ‘to the minimality of N. 


~210- 


Case 3: z is a member of exactly one of PI(H,N}: or PI(d,M). - 


- Assume w.1.0.g. that zePI(H,N). In this case, every monome | 
of PI(H,N) other than z contains an X-varfable, and every 
monome of PI(J,N) contains a Y-variable. Thus every sponse of 
PI(G) contains a Y-variable, and those which do not contain an 
X-variable in addition are of the form z-t’, where t“ePI(J) 

(note that t may also contain the literal z). -. 

So let N° be the circuit obtained by replacing H in Pred(G,N) 
by the input node for variable z. Then: PI(G,N*) consists of those 
monomes of PI(G,N) which do not contain both an X+variable and 
a Y-variable, and we have a circuit .N~ which stil]? computes 
fag by Lemma 2.8. Since G is not mixed in N*, we have reduced 


the number of mixed gates in N by one. 


In the event that G is an v-gate, we use an argument dual 
to the one above to also replace G by a non-mixed: gate. Recur- 
sively repeating the. above construction for each initial mixed gate, 
we obtain a modified M-circuit for fag with no mixed gates, and no 
more total number of gates than the original. Hence the theorem is 


proven. O Theorem 1. 


2. Corollary (Fischer): Suppose that feB, k and 9B, og are 
m.b.f's. Then MC(fxg) = MC(f) + MC(g), and any optimal circuit 


for fxg contains no mixed gates. 
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Proof: Define F°(Xy 900+ 9X02). = F(xys---0%,) and 
9°(Yyae++ sVez) = G(Yys+++ Yn). Then MC(f~) = MC(f) and 
MC(g*) = MC(g): Also fag’ = fxg, so the first part of the 
corollary holds. Moreover since fag’ does not depend on variable 
z, cases 2 and 3 of the proof of Theorem 1 can never hold for a 
minimal circuit, and hence no mixed gates can occur. 


DB Corollary 2. 


In a similar fashion to Theorem 1, one also proves that 


the following holds. 
3. Theorem: Suppose that feB and geB are m.b.f.'s. Then 
—— n,k m2 


(1) MC (fxg) = MC (Ff) * MC, (g) 
and | 


(2) MC)(fxg) = MC\(F) + MC (g). 


The proof of part (1) is similar to that of Theorem 1; 
one considers an a-minimal circuit for fxg which among all such 
also has a minimal number of v-gates. part (2) holds by duality. 


A similar result holds to Theorem 3 holds for fag. 


Note: We observe that the analogous question to Theorems 1] and 2 


when f and g_ have 2 variables in common is false as the example 
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f(z) 5Z) = g(Z, 525) = 2) A 


demonstrates . 
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_ APPENDIX 3 


In this appendix we present a version of Spira’ Ss “algorithm 
for computing shortest distances in a graph [1973] which has 
a(n? ) average running time over a certain class of Srapns: 
This counter-example, which was previously reported in [Bloniarz, 
Fischer, and Meyer 1976], reveals why we needed to revise, correct, 
and verify Spira's original approach. | 

Spira's algorithm operates by computing the distance from a 
particular "source" node ji to all other nodes in the graph. By 
repeating the algorithm for each source node i, the shortest- 
distance matrix may be found. Two differences between this algor- 
ithm, which we call Sis and the algorithm R. of Chapter 7 are 
noted. The first is that, in Sj» no assumptions are made about 
the value returned by MINQ in the case in which there are nodes 
in the queue with equal KEY‘'s; the queue might utilize information 
about the graph in breaking ties. This is a minor difficulty. 
The other difference is that when a near node has several edges 
of equal weight emanating from it, then these edges are not nec- 
essarily examined in successive passes of the algorithm. These in- 
complete specifications lead to the problems pe cdestiibe later. A 


Pascal-like implementation of S; is given below: 
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ALGORITHM. S,, (Spira's Algorithm) 


FOR L : 
FOR L 


1 ton DO D(L) := NIL; 
1 ton dO P(L) = 13) 
NEAR = (433 D(i) i= 05 J := NILs CLEARQ; 
IE W(i,1) # NIL THEN <KEY,u> = W(4,1),4>5 
WHILE J # NIL po a. 
BEGIN K := A(J,P(J)); 
IF KéNEAR THEN : 
BEGIN NEAR := NEARUCK} | 
D(k) z= KEY; 
IF [NEAR] =n THEN HALT; 
IF W(k, ) } NIL THEN INSERT KEY + w(K, 1), k) 


1 
_ 


END ; 
P(J) := P(J) + 1; 
IF W(J,P(J)) # NIL THEN INSERTQ( KEY + H(K,1) 5K); 
<KEY,J> = MINQ” ae | 
END 


-The proof that Ss; correctly computes the shortest distance 
D(j) from node i to node 5 is identical with that given in 
Chapter 7. for algorithm Re. | 

To show that algorithm 8; can be implemented poorly, for 
simplicity we will restrict ourselves to the case in which all 


edge weights are either 0 or -~. This corresponds to computing 
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the transitive closure of a graph as mentioned in Chapter 7. 
In this case, every node in the priority queue has a KEY of zero, 
and the value returned by MINQ jis an arbitrary pair in the queue. 

We present an example in which the: priority queue is implemented 
so that the value returned by MINQ does not utilize any information 
about the graph (i.e. the matrices A and W) other than that 
_ given it by previous INSERTQ operations. In particular, we maintain a 
first-in, first-out queue to store the nodes in the priority queue. A node 
is INSERT'ed at the end of the queue, and MINQ removes and eatin 
the node at the beginning of the queue. 


Theorem: Let P be the uniform probabitity distribution on n-node 
weighted directed graphs in which all edge-weights are zero or ~. For 
any graph GeG., assume (as usual) that edges. of equal weight in 

all adjacency lists are sorted by increasing node index. If 

N,(G) is the number of times the subroutine MINQ. is catled 


in executing S. ‘on G, then, when implemented as described above, 
4 
ED(N.) = a(n") 


Proof: A straightforward argument shows that an average 
graph under this probability distribution . : 

(1) has at least n/3 edges emanating from each node 
and (2) has a path (of weight 0) from every node to every other 


node in the graph. 
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It suffices to show that there is a constant c > 0 such that if 
G is ‘a graph satisfying (1) and (2),. then N,(6). 2 cn’. 

So suppose G satisfies. (1) and (2). Then, in the execution 
of S; on G, node n will be placed in- NEAR at. some point. 
Hence, at some point in the execution we must-have: K.= n; let Jo 
be the value of J at the point (the edge from dp. to on is 
being "examined" at that point). Since jg had at. least n/3. 
edges from it, and since the edge to nis the last edge listed 


in the ig” 


row of A and. W, we know. that Jo: must have been 
placed in NEAR, placed on the queue, and subsequently returned by. 
MINQ at least n/3 times. 


Following the: th 


time that Jo is returned by MINQ, an 
edge emanating from jy is examined, and its endpoint AJ »2) 
is added to the queue (unless that element. had previously been. 
placed on the queue). The: queue discipline implies that Aig 2) 
will be returned by MINQ inbetween suecessive returns. of Jp 


th time that, Jp is--returned by. 


by MINQ, starting from the & 
MINQ and continuing at least until either node n_ is placed in 
NEAR (after at least n/3 - % further returns ‘of: jy by MINQ) or 
until every edge from A(5q>2) has been examined. Thus A( ig 2%) 
is returned by MINQ at least min(n/3 - 2 - 1, n/3) = n/3- 2-1 
times. Since this is true at least for 1s 2 < n/3, the total 


number of calls to MINQ is at least 
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n/3-1 
5 (n/3 - 2 - 1) = a(n?) 


2=1 
and the statement is proved. 


(] Theorem. 


Since this theorem is true for every ie[1:n], a transitive 
closure (or shortest path) algorithm implemented as above will have 


a(n?) average time, contrary to Spira's original assertion. 
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